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(54) Title: LENGTH DETERMINATION OF NUCLEIC ACID REPEAT SEQUENCES BY DISCONTINUOUS PRIMER EXTENSION 



(57) Abstract 

Disclosed is a method for determining the number of re- 
peat units in a repeat region of a target nucleic acid. The method 
comprises a discontinuous primer extension reaction wherein a 
primer is extended in discrete increments corresponding to a sin- 
gle repeat unit. Following each increment of primer extension, a 
detection step is performed in which a modulation in a signal is 
detected when the primer has been extended by an amount equal 
to the total length of a repeat region. By counting the number 
of increments of discrete primer extension required to cause a 
modulation in the signal, the number of repeat units making up 
the repeat region is determined. In the method, a plurality of 
different-sequence primers are contacted with a polynucleotide 
sample under conditions effective for the primers to anneal to 
primer-complementary regions in one or more target polynu- 
cleotides, to form one or more target-primer hybrid(s), wherein 
either (1) each different-sequence primer contains (i) a target 
binding segment and (ii) a tag segment having a nucleotide 
sequence that uniquely identifies the target binding segment, 
or (2) one or more polynucleotides in the sample are tagged 
polynucleotides that contain a tag segment having a nucleotide 
sequence that uniquely identifies the attached polynucleotide. 
The tagged moieties can conveniently be immobilized on an ar- 
ray of immobilized tag complements for analysis. 
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WO 00*32824 PCT/US99/28584 

LENGTH DETERMINATION OF NUCLEIC ACID REPEAT SEQUENCES BY 
DISCONTINUOUS PRIMER EXTENSION 



5 FIELD OF THE INVENTION 

This invention relates to methods and kits useful for determining the length of nucleic 
acid repeat sequences. More specifically, this invention relates to methods and kits useful for 
determining the length of nucleic acid repeat sequences by employing a discontinuous primer 
extension reaction. 
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BACKGROUND 

Methods for the analysis of genetic polymorphism have found wide utility in basic 
research, clinical diagnostics, forensics, and other areas. One particularly useful method of 
detecting genetic polymorphism is based on variations in the length of repeat sequences, such 
15 methods being variously referred to as short tandem repeat analysis (STR), variable number of 
tandem repeat analysis (VNTR), minisatellite analysis, and microsatellite analysis. 

Detection of length polymorphisms in nucleic acid repeat sequences has up to now 
relied on gel electrophoresis for the determination of the length of the repeat sequence. 

20 However, gel electrophoresis has several important drawbacks in the context of repeat 
sequence length polymorphism analysis. First, molecular length measurements based on 
electrophoretic mobility are inherently imprecise due to a complicated relationship between 
molecular size and electrophoretic mobility. Second, the degree to which the electrophoretic 
process can be multiplexed is limited by the number of electrophoresis lanes and by the size of 

25 different loci run in a single lane, i.e., loci must be selected which do not electrophoretically 
co-migrate. 



SUMMARY 

The method of the present invention comprises a discontinuous primer extension 

30 reaction wherein a primer is extended in discrete increments such that in each increment of 

primer extension the primer is extended by an amount corresponding to a single repeat unit. 

Following each increment of discrete primer extension, a detection step is performed in which 

a modulation in a signal is detected when the primer has been extended by an amount equal to 

the total length of a repeat region. Thus, by counting the number of increments of discrete 
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primer extension required to cause a modulation in the signal, the number of repeat units 
making up the repeat region is determined. 

It is an object of the present invention to provide a precise and reproducible method for 
5 determining the number of repeat units making up a repeat region of a nucleic acid repeat 
sequence. 

It is another object of the present invention to provide a method for determining the 
number of repeat units making up a repeat region of a nucleic acid repeat sequence which can 
1 0 perform a large number of measurements in parallel. 

It is yet an additional object of the present invention to provide a method for 
determining the number of repeat units making up a repeat region of a nucleic acid repeat 
sequence which does not require an electrophoretic separation. 

15 

It is an object of the present invention to provide kits and reagents useful for practicing 
a method for determining the number of repeat units making up a repeat region of a nucleic 
acid repeat sequence having the above described characteristics. 

20 In a first aspect, the foregoing and other objects of the invention are achieved by a 

method for determining the number of repeat units in a repeat region of a target nucleic acid 
comprising annealing a primer-complementary portion of a target nucleic acid to a primer 
thereby forming a target-primer hybrid; performing a first primer extension reaction using a 
first primer extension reagent; separating the target-primer hybrid and unreacted first primer 

25 extension reagent; performing a second primer extension reaction using a second primer 
extension reagent, wherein at least one of the first or second primer extension reagents includes 
an extendible nucleotide having a label attached thereto; separating the target-primer hybrid 
from unreacted second primer extension reagent; measuring a signal produced by the label; 
treating the label so as to render the label undetectable; and repeating the above steps until the 

30 signal is substantially less than a signal detected in a previous cycle. 

In one preferred embodiment of the first aspect of the invention, the step of performing 
a second primer extension reaction further includes reacting the target-primer hybrid with a 
primer termination reagent. 

4 
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In yet another preferred embodiment of the first aspect of the invention, the label is a 
fluorescent or chemiluminescent molecule. 

5 In another preferred embodiment of the first aspect of the invention, the label is 

attached to the extendible nucleotide through a cleavable linker. 

In an additional preferred embodiment of the first aspect of the invention, the target 
nucleic acid is amplified prior to analysis. Preferably such amplification is achieved using a 
10 PCR. 

In another preferred embodiment of the first aspect of the invention, the step of treating 
the label so as to render the label undetectable includes either cleaving the label from the 
labeled extendible nucleotide or destroying a signal producing property of the label. 

15 

In another preferred embodiment of the first aspect of the invention, the target-primer 
hybrid is attached to a solid support. Preferably, one of the primer or the target nucleic acid is 
attached to the solid support. 

20 In a novel, preferred embodiment employing a solid support, the invention includes a 

method for determining the number of repeat units in a repeat region of a target nucleic acid 
comprising the steps of: 

(A) contacting a plurality of different-sequence primers with a polynucleotide sample 
under conditions effective for the primers to anneal to primer-complementary regions in one or 

25 more target polynucleotides, to form one or more target-primer hybrid(s), wherein either (1) 
each different-sequence primer contains (i) a target binding segment and (ii) a tag segment 
having a nucleotide sequence that uniquely identifies the target binding segment, or (2) one or 
more polynucleotides in the sample are tagged polynucleotides that contain a tag segment 
having a nucleotide sequence that uniquely identifies the attached polynucleotide, 

30 (B) performing a first primer extension reaction on the hybrid(s) using a first primer 

extension reagent; 

(C) separating the target-primer hybrid(s) and unreacted first primer extension reagent; 
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(D) performing a second primer extension reaction on the hybrid(s) using a second 
primer extension reagent, wherein at least one of the first or second primer extension reagents 
includes an extendible nucleotide having a label attached thereto; 

(E) separating the target-primer hybrid(s) from unreacted second primer extension 
5 reagent; 

(F) measuring a signal produced by the label; 

(G) treating the label so as to render the label undetectable; and 

(H) repeating a cycle of steps (A) through (G) until the signal detected in the target- 
primer hybrid(s) is substantially less than a signal detected in a previous cycle, 

10 wherein (I) prior to step (F), at least an aliquot of either (1) the different-sequence 

primers or (2) the tagged sample polynucleotides are contacted with an addressable array of 
immobilized, different tag complements, and each different tag complement contains a 
sequence that is complementary to one of the tag segments, under conditions effective to 
hybridize the tag segments to corresponding tag complements on the support. 

15 

In one embodiment, the contacting in step (I) is performed prior to step (A). In another 
embodiment, the contacting in step (I) is performed after step (A), and/or before any one of 
steps (B), (C), (D), (E), and (F). In yet another embodiment, steps (A) through (H) are 
performed on at least two replicate arrays, and one of the replicate arrays is subjected to at least 
20 one more cycle of steps (A) through (G) than is a second replicate array. 

In a second aspect, the foregoing and other objects of the invention are achieved by a 
method for determining the number of repeat units in a repeat region of a target nucleic acid 
comprising annealing a primer-complementary portion of a target nucleic acid to a primer 

25 thereby forming a target-primer hybrid; performing a first primer extension reaction using a 
first primer-extension reagent; separating the target-primer hybrid from unreacted first primer 
extension reagent; performing a second primer extension reaction using a second primer 
extension reagent and with a primer termination reagent, the primer termination reagent 
including a nucleotide terminator having a label attached thereto; separating the target-primer 

30 hybrid from unreacted second primer extension reagent and unreacted primer termination 
reagent; measuring a signal produced by the label; and repeating the above steps until a signal 
is detected indicating incorporation of the labeled nucleotide terminator into the primer 
extension product. 
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In yet another preferred embodiment of the second aspect of the invention, the label is 
selected from the group consisting of fluorescent and chemiluminescent molecules. 

In an additional preferred embodiment of the second aspect of the invention, the target 
5 nucleic acid is amplified prior to analysis. Preferably such amplification is achieved using a 
PGR. 

In a preferred embodiment of the second aspect of the invention, the target-primer 
hybrid is attached to a solid support. Preferably, one of the primer or the target nucleic acid is 
10 attached to the solid support. 

In a novel, preferred embodiment,the invention includes a method for determining the 
number of repeat units in a repeat region of a target nucleic acid comprising the steps of: 

(A) contacting a plurality of different-sequence primers with a polynucleotide sample 
1 5 under conditions effective for the primers to anneal to primer-complementary regions in one or 

more target polynucleotides, to form one or more target-primer hybrid(s), wherein either (1) 
each different- sequence primer contains (i) a target binding segment and (ii) a tag segment 
having a nucleotide sequence that uniquely identifies the target binding segment, or (2) one or 
more polynucleotides in the sample are tagged polynucleotides that contain a tag segment 
20 having a nucleotide sequence that uniquely identifies the attached polynucleotide, 

(B) performing a first primer extension reaction on the hybrid(s) using a first primer- 
extension reagent; 

(C) separating the target-primer hybrid(s) from unreacted first primer extension 
reagent; 

25 (D) performing a second primer extension reaction on the hybrid(s) using a second 

primer extension reagent and with a primer termination reagent, the primer termination reagent 

including a nucleotide terminator having a label attached thereto; 

(E) separating the target-primer hybrid(s) from unreacted second primer extension 

reagent and unreacted primer termination reagent; 

30 (F) measuring a signal produced by the label; and 

(G) repeating a cycle of steps (A) through (F) until a signal is detected indicating 

incorporation of the nucleotide terminator, 

wherein (H) prior to step (F), at least an aliquot of either (1) the different-sequence 

primers or (2) the tagged sample polynucleotides, are contacted with an addressable array of 
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immobilized, different tag complements, and each different tag complement contains a 
sequence that is complementary to one of the tag segments, under conditions effective to 
hybridize the tag segments to corresponding tag complements on the support. 

5 In one embodiment, the contacting in step (H) is performed prior to step (A). In another 

embodiment, the contacting in step (H) is performed after step (A), and/or before any one of 
steps (B), (C), (D), (E), and (F). In yet another embodiment, steps (A) through (G) are 
performed on at least two replicate arrays, and one of the replicate arrays is subjected to at least 
one more cycle of steps (A) through (F) than is a second replicate array. 

10 

In a third aspect, the foregoing and other objects of the invention are achieved by a kit 
useful for determining the number of repeat units in a repeat region of a target nucleic acid 
comprising a primer having a sequence complementary to a primer-complementary portion of 
a target nucleic acid; a first primer extension reagent; and a second primer extension reagent, 
15 wherein at least one of the first or second primer extension reagents includes an extendible 
nucleotide having a label attached thereto. 

In a preferred embodiment of the third aspect of the invention, the primer is attached to 
a solid support. 

20 

In an additional preferred embodiment of the second aspect of the invention, the label is 
selected from the group consisting of fluorescent and chemiluminescent molecules. 

In another preferred embodiment of the second aspect of the invention, the label is 
25 attached to the extendible nucleotide through a cleavable linker. 

These and other objects, features, and advantages of the present invention will become 
better understood with reference to the following description, drawings, and appended claims. 

30 BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 shows a schematic depiction of a target nucleic acid. 
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FIGS. 2A-C show a first aspect of the method of the invention wherein an extendible 
nucleotide is labeled and the label is rendered undetectable subsequent to each discrete 
increment of primer extension. 

5 FIGS. 3A-B show a second aspect of the method of the invention wherein a nucleotide 

terminator is labeled. 

FIGS 4 and 5 show exemplary schemes for practicing the invention using a solid phase 
support containing an array of tag complements for obtaining sequence repeat information for 
10 a plurality of different samples in parallel, using tagged primers or tagged sample 
polynucleotides. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Reference will now be made in detail to the preferred embodiments of the invention, 
15 examples of which are illustrated in the accompanying drawings. While the invention will be 
described in conjunction with the preferred embodiments, it will be understood that they are 
not intended to limit the invention to those embodiments. On the contrary, the invention is 
intended to cover alternatives, modifications, and equivalents, which may be included within 
the invention as defined by the appended claims. 

20 

I. DEFINITIONS 

Unless stated otherwise, the following terms and phrases as used herein are intended to 
have the following meanings: 

25 "Nucleoside" refers to a compound consisting of a purine, deazapurine, or pyrimidine 

nucleoside base, e.g., adenine, guanine, cytosine, uracil, thymine, deazaadenine, 
deazaguanosine, and the like, linked to a pentose at the V position, including 2'-deoxy and 2- 
hydroxyl forms (Stryer). 

30 The term "nucleotide" as used herein refers to a phosphate ester of a nucleoside, e.g., a 

triphosphate ester, wherein the most common site of esterification is the hydroxy! group 

attached at the C-5 position of the pentose. Many times in the present disclosure the term 

nucleoside will be intended to include both nucleosides and nucleotides. The terms nucleotide 

and nucleoside as used herein are intended to include synthetic analogs having modified 

9 
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nucleoside base moieties, modified sugar moieties, and/or modified phosphate ester moieties, 
e.g., as described elsewhere (Scheit; Eckstein). 

"Polynucleotide" or "oligonucleotide" refer to linear polymers of nucleotide monomers, 
including single, double and triple stranded deoxyribonucleotides, ribonucleotides, a-anomeric 
forms thereof, and the like. Usually the nucleoside monomers are linked by phosphodiester 
linkages, where as used herein, the term "phosphodiester linkage" refers to phosphodiester 
bonds or bonds including phosphate analogs thereof wherein the phosphorous atom is in the 
+5 oxidation state and one or more of the oxygen atoms is replaced with a non-oxygen moiety. 
Exemplary phosphate analogs include phosphorothioate, phosphorodithioate, 
phosphoroselenoate, phosphorodiselenoate, phosphoroanilothioate, phosphoranilidate, 
phosphoramidate, boronophosphates, and the like, including associated counterions, e.g., H + , 
NH 4 + , Na\ and the like if such counterions are present. Alternatively, polynucleotides may 
comprise polymers of non-nucleotidic monomers, linked through phosphodiester linkages or 
other linkages, which are capable of forming sequence-specific hybrids with a target nucleic 
acid, e.g., peptide nucleic acid polymers (PNAs, e.g., see Knudsen, 1996). Polynucleotides 
typically range in size from a few monomelic units, e.g. 8-40, to several thousands of 
monomeric units. Whenever a polynucleotide is represented by a sequence of letters, such as 
"ATGCCTG," it will be understood that the nucleotides are in 5'->3' order from left to right 
and that M A" denotes deoxyadenosine, "C" denotes deoxycytidine, "G" denotes 
deoxyguanosine, and "T" denotes thymidine, unless otherwise noted. 

"Extendible nucleotide" means any nucleotide that when incorporated into a primer 
extension product during a primer extension reaction allows for the further extension of such 
primer extension product. Exemplary extendible nucleotides include 2'-deoxynucleotide 
triphosphates, e.g., 2'-deoxyuridine-5'-triphosphate, 2 , -deoxyguanosine-5 , -triphosphate, 2'- 
deoxy-7-deazadeoxyguanosine-5'-triphosphate, 2 , -deoxyadenosine-5 '-triphosphate, T- 
deoxythymidine-S'-triphosphate, and 2 '-deoxycytidine-5' -triphosphate. Optionally, one or 
more of the extendible nucleotides includes a label. 

"Nucleotide terminator" means any nucleotide that when incorporated into a primer 
extension product prevents the further extension of such primer extension product. One 
requirement of a nucleotide terminator is that when the nucleotide terminator includes a 

10 
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ribofuranose sugar portion, the 3'~position must not have a hydroxy group capable of being 
subsequently used by a polymerase to incorporate additional nucleotides. Alternatively, a 
ribofuranose analog could be used, such as arabinose. Exemplary nucleotide terminators 
include 2\3 , -dideoxy-p-D-ribofuranosyl, P-D-arabinofuranosyl, 3'-deoxy-p-D- 
5 arabinofuranosyh 3'-amino-2',3 , -dideoxy-P-D-ribofuranosyl, and 2\3'-dideoxy-3 , -fluoro-p- 
D-ribofuranosyl (Chidgeavadze, 1984, 1985). Nucleotide terminators also include reversible 
nucleotide terminators (Metzker), and 3'-deoxy substituents such as hydrogen, 3'-fluoro, 3'- 
amino, and 3'-azido, for example (Mikhailopulo et al., 1989; Krayevski et al., 1984; 
Chidgeavadze, 1986). 

10 

"Polymerase" means an enzyme or other catalyst capable of catalyzing a reaction 
leading to a target-sequence dependent incorporation of a nucleotide onto a 3'-end of a primer 
or primer extension product when such primer or primer extension product is annealed to a 
target nucleic acid. Exemplary polymerases include but are not limited to Pfu DNA 
15 polymerase, E.Coli Polymerase I, T-7 polymerase, reverse transcriptase, Taq DNA 
polymerase, and the like (Kornberg and Baker). 

"Label" means any moiety that, when attached to a nucleotide or polynucleotide of the 
invention, render such nucleotide or polynucleotide detectable using known detection means. 

20 Labels may be direct labels which themselves are detectable or indirect labels which are 
detectable in combination with other agents. Exemplary direct labels include but are not 
limited to fluorophores, chromophores, radioisotopes, spin-labels, chemiluminescent labels, 
and the like. Exemplary indirect labels include enzymes which catalyze a signal-producing 
event, and ligands such as an antigen or biotin which can bind specifically with high affinity to 

25 a detectable anti-ligand, such as a labeled antibody or avidin. 

"Primer extension reaction" means a reaction between a target-primer hybrid and a 
nucleotide which results in the addition of the nucleotide to an end of the primer, usually the 
3'-end, such that the added nucleotide is complementary to the corresponding nucleotide of the 
30 target nucleic acid. 
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"Pnmcr-extension reagent" means a reagent including components necessary to effect a 
primer extension reaction. Primer extension reagents typically include (i) a polymerase 
enzyme; (ii) a buffer; and (iii) one or more extendible nucleotides. 

"Specific binding pair" refers to a pair of molecules that specifically bind to one 
another to form a binding complex. Examples of specific binding pairs include, but are not 
limited to antibody-antigen (or hapten) pairs, ligand-receptor pairs, enzyme-substrate pairs, 
biotin-avidin pairs, polynucleotides having complementary base pairs, and the like. 

"Primer is a polynucleotide capable of selectively annealing to a specified target 
sequence and thereafter serve as a point of initiation of a primer extension reaction wherein the 
primer is extended in the 3 5' a 5'-* 3' direction, typically the latter. 

II. MATERIALS USED IN THE MRTHQD OF THF. INVENTION 

A. Target Nucleic Acid 

The target nucleic acids for use with the invention may be derived from any living or 
once living organisms, including but not limited to prokaryotes, eukaryotes, plants, animals, 
and viruses, as well as synthetic nucleic acids. The target nucleic acids may originate from any 
of a wide variety of sample types, such as cell nuclei (e.g., genomic DNA) and extranuclear 
nucleic acids, e.g., plasmids, mitrochondrial nucleic acids, and the like. The target nucleic 
acids can include DNA or RNA, and are usually DNA. 

Many methods are available for the isolation and purification of a target nucleic acid 
for use in the present invention. The preferred purification method should provide target 
nucleic acid sufficiently free of protein to allow efficient primer extension and nucleic acid 
amplification. Preferred purification methods include (i) organic extraction followed by 
ethanol precipitation, e.g., using a phenol/chloroform organic reagent (Ausubel), preferably 
using an automated DNA extractor, e.g., the Model 341 DNA Extractor available from PE 
Applied Biosystems (Foster City, CA); (ii) solid phase adsorption methods (Walsh, Boom); 
and (iii) salt-induced DNA precipitation methods (Miller), such methods being typically 
referred to as "salting-out" methods. Optimally, each of the above purification methods is 
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preceded by an enzyme digestion step to help eliminate protein from the sample, e.g., digestion 
with proteinase K, or other like proteases. 

To increase sensitivity, preferably the target nucleic acid is amplified prior to 
5 performing the method using a suitable nucleic acid amplification procedure. Such 
amplification may be linear or exponential. In a preferred embodiment, amplification of the 
target nucleic acid is accomplished using the polymerase chain reaction (PCR) (Mullis). 
Generally, the PCR consists of an initial denaturation step which separates the strands of a 
double stranded nucleic acid sample, followed by the repetition of (i) an annealing step which 

10 allows amplification primers to anneal specifically to positions flanking a target sequence; (ii) 
an extension step which extends the primers in a 5'-> 3* direction thereby forming an 
amplicon nucleic acid complementary to the target sequence, and (iii) a denaturation step 
which causes the separation of the amplicon and the target sequence. Each of the above steps 
may be conducted at a different temperature, where the temperature changes may be 

15 accomplished using a thermocycler (PE Applied Biosystems, Foster City, CA). 

The generalized structure of a target nucleic acid for use in the present invention is 
shown in FIG. 1 where the target nucleic acid 5 includes a 5'-flanking portion 10 including a 
primer complementary portion 15, a 3'-flanking portion 25, and a repeat region 20 located 

20 between the S'-flanking portion and the and the 3 , -flanking portion. The repeat region 20 of 
the target nucleic acid comprises multiple repeat units (R) n 21 where R indicates a repeat unit 
and n designates the number of repeat units making up the repeat region. The repeat unit R 

_ may be any type of repeat motif, for example, but not limited to a microsatellite repeat 
(Webber and May; Smeets; Williamson), a minisatellite repeat (Jeffreys, Caskey), or an a- 

25 satellite repeat (Jabs). 

The repeat region may be made up of multiple types of repeat units or repeat units 
which are themselves polymorphic. 

30 B. Primer 

Primers for use in the present invention are designed to obtain a balance between 
specificity of primer annealing, i.e., the frequency with which an undesired target sequence 
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participates in a primer extension reaction, and efficiency of primer extension, i.e., the extent 
to which a desired target sequence participates in the primer extension reaction. 

Specificity of primer annealing is generally controlled by the length of the primer and 
5 the temperature of the annealing reaction. Polynucleotides between about 18 and 24 bases are 
preferred because such polynucleotides tend to be very sequence specific when the annealing 
temperature is set within a few degrees of a primer melting temperature (Dieffenbach). To 
facilitate primer extension, a 3'-end of the primer includes an — OH group or other moiety 
which allows incorporation of a nucleotide onto the 3 '-end of the primer. There exist a number 
10 of computer programs to facilitate primer selection in different contexts (Osborne; Montpetit). 

In a preferred embodiment, the sequence of the primer is selected such that the primer 
anneals to the primer complementary portion of the 5 '-flanking portion of the target nucleic 
acid. Preferably, the primer anneals such that a 3'-end of the primer is adjacent to a 5'-end of a 
15 repeat region of a target nucleic acid. However, the primer may also anneal to a segment of 
the repeat region of the target nucleic acid so long as it is at least partially anchored to the 5'- 
flanking portion of the target. 

For embodiments of the invention which employ sample identifier tags and arrays of 
20 tag complements, the invention utilizes a plurality of extendable, different-sequence primers 
for detecting target sequences of interests. In one embodiment, the tagged primer includes a 
target binding segment, a tag segment, and an extendable primer end (5' or 3'). The target 
binding segment includes a polynucleotide sequence which is selected to bind to a selected 
target sequence. The tag segment contains a unique polynucleotide sequence that allows 
25 identification of the target binding segment to which the tag segment is attached. The tag 
segment can be directly attached to the distal end of the target binding segment, or is optionally 
linked to the tag segment by an intervening spacer group. In another embodiment, the tag 
segment is linked to an internal site within the target binding segment. Thus, the tag can be 
linked to an intersubunit linking group, or to a nucleotide base, within the target binding 
30 segment. Preferably, the tag is attached to an end of the target binding segment that is distal 
with respect to the extendable end of the primer. 

The sequence of each target binding segment is selected to hybridize to a selected 

complementary target which contains a potential polymorphism or mutation, preferably such 
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that the 3'-end of the primer is adjacent to a 5^-end of a repeat region of a target nucleic acid 
(for extension in the 5' to 3' direction). However, the primer may also anneal to a segment of 
the repeat region of the target nucleic acid so long as it is at least partially anchored to the 5'- 
flanking portion of the target. 

The length of the target binding segment in each tagged primer is selected to ensure 
specific hybridization of the primer to the desired target, without significant cross- 
hybridization to non-target nucleic acids in the sample. Also, to enhance primer specificity, it 
is preferred that the melting temperatures of the target binding segments are within a few 
degrees of each other. Preferably, the melting temperatures of the target binding segments fall 
within a ATm range (Tmax - Tmin) of 10°C or less, and preferably 5°C or less. This can be 
accomplished by suitable choice of binding segment lengths based on known methods for 
predicting primer melting temperatures (Breslauer, 1986; Rychlik, 1989 and 1990; Wetmur, 
1991; Osborne. 1991; Montpetit, 1992) for example. As above, target binding segments 
between about 1 8 and 24 bases in length are preferred. 

The tag segment in each tagged primer is designed to contain a sequence that uniquely 
identifies the attached target binding segment. Thus, the tag sequences should be selected to 
minimize (1) internal, self-hybridization, (2) hybridization with other same-sequence tags, (3) 
hybridization with other, different sequence tag complements. (4) and hybridization with the 
sample polynucleotides. Also, it is preferred that each tag can specifically recognize and 
hybridize to its corresponding tag complement under the same conditions for all tags in the 
primers. 

Tag sequences can be selected by any suitable method. For example, computer 
algorithms for selected non-crosshybridizing sets of tags are described in Brenner (1996) and 
Shoemaker (1997). Preferably, the tag sequences have strands that are within a preselected 
temperature range, as discussed above with respect to the extendable primers. Preferably, the 
melting temperatures of the target binding segment's fall within a ATm range (Tmax - Tmin) of 
10°C or less, and preferably within 5°C or less, as calculated using any of the methods above 
(e.g., Breslauer). Preferably, the tag segments are at least 12 bases in length to facilitate 
specific hybridization to corresponding tag complements. Typically, tag segments are from 12 
to 60 bases in length, and typically from 15 to 30 bases in length. 
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Tags and tag complements may be single or double stranded, such that sequence 
specific hybridization forms either duplexes by Watson and Crick base-pairing, or triplexes by 
forward or reverse Hoogsteen bonding. In embodiments where specific hybridization occurs 
via triplex formation, coding of tag sequences follows the same principles as for duplex- 
5 forming tags; however, there are further constraints on the selection of word sequences. 
Generally, third strand association via Hoogsteen type of binding is most stable along 
homopyrimidine-homopurine tracks in a double stranded target. Usually, base triplets form in 
T-A*T or C-G*C motifs (where "-" indicates Watson-Crick pairing and "*" indicates 
Hoogsteen type of binding); however, other motifs are also possible. For example, Hoogsteen 
10 base pairing permits parallel and antiparallel orientations between the third strand (the 
Hoogsteen strand) and the purine-rich strand of the duplex to which the third strand binds, 
depending on conditions and the composition of the strands. 

There is extensive guidance in the literature for selecting appropriate sequences, 
15 orientation, conditions, nucleoside type (e.g. whether ribose or deoxyribose nucleosides are 
employed), base modifications (e.g. methylated cytosine, and the like in order to maximize, or 
otherwise regulate, triplex stability as desired in particular embodiments, e.g., Brenner (supra). 
More generally, conditions for annealing single-stranded or duplex tags to single-stranded or 
duplex sequence complements are well known, e.g. Brenner (supra), Ji et al. (1993), Cantor et 
20 al. (supra), Wetmur (1991), Breslauer et al. (1986), Schena (1995), and the like. 

Preferably, polynucleotides such as primers are synthesized conventionally on an 
automated DNA synthesizer, e.g., PE Applied Biosystems (Foster City, CA) model 392 or 394 
DNA/RNA Synthesizer, using standard chemistries, e.g., phosphoramidite chemistry 
25 (Beaucage). In an alternative method, primers can be isolated from a biological source. 

C. Solid Phase Supports 

In a preferred embodiment of the method of the invention, a target-primer hybrid is 
attached to a solid phase support during a separating step. Such attachment may be through 
30 either the target nucleic acid or the primer polynucleotide. 

Solid phase supports for use with the invention may have a wide variety of forms, 

including microparticles, beads, membranes, slides, plates, micromachined chips, and the like. 

In addition, solid phase supports of the invention may comprise a wide variety of 
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compositions, including glass, plastic, silicon, alkanethiolate-derivatized gold, GaAs, copper, 
germanium, cellulose, low cross-linked and high cross-linked polystyrene, crosslinked 
polyacrylamide matrices, silica gel, polyamide, membranes such as nylon, polyvinylidine 
difluoride (PVDF), or poly-tetrafluoroethylene, and the like. 

5 

Where attachment of the target-primer hybrid is through the primer, primers may be 
used with a solid phase support on which they were synthesized, or they may be separately 
synthesized and attached to a solid phase support for use during or before the separation step of 
the method. 

10 

When primers are synthesized on and used with the same solid phase support, such 
support may comprise a variety of forms and include a variety of linking moieties. Such 
supports may comprise microparticles or planar arrays, or matrices of regions having 
substantially uniform populations of primers, A wide variety of microparticle synthesis 

15 supports may be used with the invention, including microparticles made of controlled pore 
glass (CPG), highly cross-linked polystyrene, acrylic copolymers, cellulose, nylon, dextran, 
latex, polyacrolein, and the like. Microparticle supports further include commercially available 
nucleoside-derivatized CPG and polystyrene beads (e.g. available from Applied Biosystems, 
Foster City, CA); derivatized magnetic beads; polystyrene grafted with polyethylene glycol 

20 (e.g., TentaGel, Rapp Polymere, Tubingen Germany); and the like. Selection of the support 
characteristics, such as material, porosity, size, shape, and the like, and the type of linking 
moiety employed depends on the conditions under which the primers are used. For example, 
in the present invention, supports and linkers that minimize steric hindrance of the polymerase 
enzymes and that facilitate access to nucleotide substrate are preferred. Other important 

25 factors to be considered in selecting the most appropriate microparticle support include size 
uniformity, efficiency as a synthesis support, degree to which surface area known, and optical 
properties, e.g., clear smooth beads provide instrumentational advantages when handling large 
numbers of beads on a surface. 

30 As mentioned above, primers may also be synthesized on a single (or a few) solid phase 

supports to form an array of regions uniformly coated with primers. That is, within each 
region in such an array the same primer is synthesized. Techniques for synthesizing such 
arrays are disclosed elsewhere (Pease; Southern). 
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When primers are separately synthesized, and subsequently attached to a solid phase 
support for use, the primer may be attached to the support through a covalent linkage or a non- 
covalent linkage. When the primer is attached to the solid support through a non-covalent 
linkage, the primer includes one member of specific binding pair, e.g., biotin, the other 
5 member of the pair being attached to the solid support, e.g., avidin. Several methods are 
available for covalently linking polynucleotides to solid supports, e.g., through reaction of a 5*- 
amino polynucleotide with an isothiocyanate-functionalized glass support (Guo). A wide 
range of exemplary linking moieties for attaching primers onto solid supports either covalently 
or non-covalently are disclosed elsewhere. (Pon; Webb; Barany; Damha; Beattie; Maskos and 
10 Southern). 

Where attachment of the primer-template hybrid is through the template nucleic acid, 
and the template nucleic acid is a PGR amplicon, the means for covalent or non-covalent 
attachment may be incorporated into a PCR primer used to effect the PCR. Thus, the PCR 
15 primer may contain a member of a specific binding pair, e.g., biotin, or a reactive moiety 
which can react with a functionalized solid support to form a covalent linkage, e.g., a 5 '-amino 
group which reacts with a an isothiocyanate-functionalized glass support. 

As noted above, the invention also utilizes a set of tag complements which are 
20 complementary to corresponding tag sequences in the tagged primers. The tag complements 
are provided as an addressable array, according to the design choice of the user. By 
"addressable array* 1 is meant that the sequence of the target binding segment of each primer is 
known or can be determined from the position of hybridization of each primer on the array. 
Preferably, the tag complements are immobilized in discrete regions on a planar surface, such 
25 that each discrete region contains only tag complements having a particular sequence, and such 
that the sequence of the tag complement at each different discrete region is known. 
Conveniently, the tag complements are distributed as a periodic two-dimensional array of 
discrete tag complement regions which can be indexed via X and Y coordinates, or any 
equivalent thereof. Tag complements can be attached to appropriate solid phase support 
30 materials following the same considerations as for attachment of primers, discussed above. 

To reduce the amounts of assay reagents used for tag detection, and to facilitate the 

sequencing of large numbers of fragment sequences, the arrays are preferably formed as 

microarrays having tag complement region densities of greater than 100 regions/cm 2 , 300 
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regions/cm 2 , 10" regions/cm 2 , 3 x 10 3 regions/cm 2 , 10 4 regions/cm 2 , 10' regions/cm 2 , or 10 6 
regions/cm 2 . In addition, the number of different sequence tag complements in each array is 
preferably equal to or greater than 10, 20, 50, 100, 200, 500, 1000, 3000, 10,000, 30,000, 
100,000, or 300,000. 

5 

D. Labeled Nucleotides 

In the methods of the present invention, one or more extendible nucleotides and/or 
nucleotide terminators include a label. The label is attached to the nucleotide in such a way 
that the label does not substantially interfere with polymerase-mediated incorporation of the 

10 labeled nucleotide in a primer extension reaction. Many alternative methods are available for 
labeling nucleotides in a manner suitable for use with the present invention (Kricka). In a 
preferred class of labeling methods, a nucleoside base of the nucleotide is modified to include 
a label, i.e., the N-6 position of a purine base or the C-5 position of a pyrimidine base. A 
particularly preferred class of labeled nucleotides are the propargylethoxyamino nucleotides 

15 (Khan). 

In one preferred embodiment of the invention, a labeled extendible nucleotide is 
capable of being rendered undetectable, e.g., upon treatment with a suitable reagent or 
electromagnetic radiation. In this embodiment, the labeled extendible nucleotide may be 
20 rendered undetectable by either removing the label from the nucleotide or by destroying the 
signal-producing properties of the label. 

Several methods are available for attaching a label to an extendible nucleotide such that 
the label may be easily removed. Exemplary cleavable linkers for linking a label to a 
25 nucleotide include but are not limited to (N-[4-(p-azidosalicylamido)butyl]-3'-[2 , - 
pyridyldithio]propionamide (APDP), (bis[2-(succinimidooxycarbonyloxyl)ethyl]sulfone 
(BSOCOES), disuccininimdyl tartarate (DST), and ethylene glycobis-fsuccinimidylsuccinate] 
(EGS), and the like (Pierce Catalog). 

30 Preferred labels whose signal producing properties may be destroyed upon treatment 

with a suitable reagent or electromagnetic radiation include fluorescent labels whose 

fluorescent properties may be destroyed by photodestruction through exposure to high intensity 

light or by chemical degradation through treatment with oxidizing chemicals, e.g., oxygen, 

sodium hypochlorite, permanginate and the like. Alternative preferred classes of labels 
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include enzymes, which may be rendered undetectable by reaction with an irreversable 
enzyme inhibitor or by denaturation, and chemi luminescent labels which can undergo only a 
single light-producing transformation and are thus autodestructing. 

5 E. First and Second Primer-Extension Reagents 

The present invention includes first and second primer extension reagents that, when 
used according to methods of the invention, enable the extension of a primer to proceed in 
discrete increments of single repeat units, i.e., the primer is extended only by one repeat unit 
per discrete primer extension reaction cycle. 

10 

The first primer extension reagent of the invention includes a set of extendible 
nucleotides which allow a primer extension reaction to proceed only to the extent that a primer 
is extended by an amount less than a full repeat unit. Thus, depending on the particular 
sequence of the repeat unit, the first primer extension reagent may include a variety of possible 
15 extendible nucleotide combinations. For example, if the sequence of the repeat unit is AGCT, 
the first primer extension reagent could include extendible nucleotides T (complementary to 
A), T and C (complementary to A and G), or T and C and G (complementary to A and G and 
C). However, to prevent uncontrolled continuous primer extension, the first primer extension 
reagent should not contain extendible nucleotides T, C, G and A together. 

20 

In certain situations, it is desirable to sub-divide the first primer extension reagent into 
separate sub-reagents such that each sub-reagent includes extendible nucleotides sufficient to 
allow extension of a primer only to the extent that a sub-portion of the repeat sequence is 
formed. For example, if the repeat unit is ATGCCGT, one sub reagent of the first primer 
25 extension reagent could include extendible nucleotides T, A, and C, while another sub-reagent 
of the first primer extension reagent could include extendible nucleotides G and C. 

The second primer extension reagent of the invention includes a set of extendible 
nucleotides which allow a primer extension reaction to proceed only to the extent that the 
30 portion of a repeat unit not synthesized by the first primer extension reagent is synthesized. 
Thus, depending on the particular sequence of the repeat unit and the composition of the first 
primer reagent, the second primer extension reagent may include a variety of possible 
nucleotide combinations. Continuing the example discussed above, if the sequence of the 
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repeat unit is AGCT and the first primer extension reagent includes extendible nucleotides T 
and C the second primer extension reagent may include extendible nucleotides G and A. 

F. Primer Termination Reagent 
5 The present invention includes a primer termination reagent for causing the termination 

of primer extension such that once a primer extension product has reacted with the primer 
termination reagent, no further primer extension may be accomplished. 

The primer termination reagent of the invention includes one or more nucleotide 
10 terminators, and optionally, a set of extendible nucleotides which allow a primer extension 
reaction to proceed only to the extent that a primer is extended by an amount less than a fUll 
repeat unit in a primer extension reaction. Thus, depending on the particular sequence of the 
repeat unit, the sequence of the 3 '-flanking portion of the target nucleic acid, and the 
composition of the first and second primer extension reagents, the primer termination reagent 
15 may include a variety of possible extendible nucleotide and nucleotide terminator 
combinations. For example, if the sequence of the repeat unit is AGCT, and the first nucleotide 
of the 3'-flanking portion is G, and the first and second primer extension reagents include the 
extendible nucleotides T, C, G and A, the primer termination reagent would include only the 
nucleotide terminator C, such nucleotide terminator being complementary to the G located in 
20 the 3 , -flanking portion. Alternatively, if the first and second primer extension reagents only 
include the extendible nucleotides T and C, the primer termination reagent would include 
extendible nucleotides G and A and nucleotide terminator C. 

In certain aspects of the present invention, the primer termination reagent includes a 
25 labeled nucleotide terminator. The labeling of the nucleotide terminator is accomplished 
essentially as described above in Section D. 

III. THE METHOD 

A preferred embodiment of a first aspect of the method of the invention is 
30 schematically depicted in FIGS. 2A-C. In the figure, the method is applied to a target nucleic 
acid 5 having a repeat region 20 made up of two copies of a two-base repeat having the 
sequence "AC" and a 3'-flanking portion 25 having a G nucleotide abutting the repeat region. 
In this preferred embodiment of the first aspect, a primer 200 is annealed to a primer- 
complementary portion 15 of the target nucleic acid 5 thereby forming a target-primer hybrid. 
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The target-primer hybrid is then reacted with a first primer-extension reagent including a 
labeled extendible nucleotide T, resulting in the incorporation of the labeled T nucleotide into 
the 3'-end of a primer extension product 210. Following reaction with the first primer 
extension reagent, the first primer extension reagent is separated from the target-primer hybrid 
5 and the target-primer hybrid is reacted with a second primer-extension reagent including an 
extendible G nucleotide, resulting in the addition of the G nucleotide into a 3'-end of the 
primer extension product 215. Next the unreacted second primer extension reagent is 
separated from the target-primer hybrid and a measurement is performed to determine the 
amount of labeled extendible nucleotide incorporated into the primer extension product. As 

10 indicated by the histogram in the figure, at this point in the process, a large signal is detected, 
indicating the presence of the incorporated labeled T nucleotide. Finally, in order to prepare 
the target-primer hybrid for a subsequent discrete primer extension reaction cycle, the label 
attached to the incorporated extendible nucleotide is rendered undetectable. In the example the 
above described process steps are repeated two more times as shown in FIGS. 2 B and 2C. In 

15 the third cycle shown in FIG. 2C, the intensity of the measured signal is substantially reduced 
as compared to the signal intensities seen in the first two cycles because the labeled extendible 
nucleotide T can not be incorporated into the primer extension product at this point. Thus, this 
reduction in the measured signal seen in the third cycle indicates that the repeat region only 
contains two copies of the AC repeat unit. 

20 

A preferred embodiment of a second aspect of the method of the invention is shown in 
FIGS. 3A-B. As before, the method is applied to a target nucleic acid having a repeat region 
made up of two copies of a two-base repeat having the sequence "AC" and a 3'-flanking 
portion having a G nucleotide abutting the repeat region. In this preferred embodiment of the 

25 second aspect, as before, a primer 200 is annealed to a primer-complementary portion 15 of a 
target nucleic acid 5 thereby forming a target-primer hybrid. The target-primer hybrid is then 
reacted with a first primer-extension reagent including an unlabeled extendible nucleotide T, 
resulting in the incorporation of the unlabeled T nucleotide into the 3 '-end of a primer 
extension product 310. Following reaction with the first primer extension reagent, the first 

30 primer extension reagent is separated from the target-primer hybrid and the target-primer 

hybrid is reacted with a second primer-extension reagent, including an extendible G 

nucleotide, and a primer termination reagent including a labeled C nucleotide terminator, 

resulting in the addition of only the G extendible nucleotide into the 3 '-end of the primer 

extension product 315. Next the unreacted second primer extension reagent and primer 
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termination reagent are separated from the target-primer hybrid and a measurement is 
performed to determine the amount of labeled nucleotide terminator incorporated into the 
primer extension product. As indicated in the figure, at this point in the process, no signal is 
detected, indicating that the labeled nucleotide terminator is not incorporated into the primer 
5 extension product during this cycle of the process. In FIG. 3B. the above described process 
step is repeated. In this second cycle shown in FIG. 3B, the intensity of the measured signal is 
substantially increased as compared to the nominally zero signal intensity seen in the first step 
of the process because during the second step, the labeled nucleotide C terminator is 
incorporated into the primer extension product. 

10 

The following discussion provides a more detailed description of the above-described 
method steps of the first and second aspects of the invention. 

A. Primer Annealing 

15 The annealing reaction is performed under conditions which are stringent enough to 

guarantee sequence specificity yet sufficiently permissive to allow formation of stable hybrids 
at an acceptable rate. The temperature and length of time required for primer annealing depend 
upon several factors including the base composition, length and concentration of the primer, 
and the nature of the solvent used, e.g., the concentration of cosolvents such as DMSO, 

20 formamide. or glycerol, and counter ions such as magnesium. Typically, hybridization with 
synthetic polynucleotides is carried out at a temperature that is approximately 5 to 10°C below 
the melting temperature of the target-primer hybrid in the annealing solvent. Preferably, the 
annealing temperature is in the range of 55 to 75°C and the primer concentration is 
approximately 0.2 jiM. Under these preferred conditions, the annealing reaction will be 

25 complete in only a few seconds. 

B. Primer Extension Reaction 

The time required to effect a primer extension reaction depends upon the length and 
concentration of the target sequence and upon the reaction temperature. Estimates for the rate 
30 of nucleotide addition under typical conditions vary from 35 to 100 nucleotides per second 
depending upon the buffer, pH, salt concentration, polymerase enzyme, and the nature of the 
DNA template. 
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In order to achieve a primer extension reaction which proceeds in discrete increments 
of single repeat units, according to the method of the invention, the primer extension reaction 
is divided into two independent steps: a first primer extension reaction and a second primer 
extension reaction. In the first primer extension reaction, the primer is extend by an amount 
5 less than the length of a single repeat unit, where control of the extent of primer extension is 
effected by the composition of a first primer extension reagent. In the second primer extension 
reaction, the primer is extended by an amount such that, in combination with the first primer 
extension reaction, the primer is extended by an amount equal to the length of a single repeat 
unit. 

10 

According to the first aspect of the method of the invention, one of the first or second 
primer extension reagents includes a labeled extendible nucleotide. 

Also according to the first aspect of the method of the invention, in a variant of the 
15 above-described two-step discrete primer extension reaction, the second primer extension 
reagent includes a primer termination reagent. Thus, if after a second primer extension 
reaction the primer has been extended to the end of the repeat region of the target nucleic acid, 
a nucleotide terminator will be incorporated into the primer extension product thus prohibiting 
any further extension of that primer extension product. This is advantageous because it will 
20 remove the possibility that any spurious primer extension beyond the repeat region of the target 
nucleic acid will take place. This embodiment of the invention is particularly preferred where 
two alleles of a repeat sequence are being investigated simultaneously. 

According to the second aspect of the invention, a primer termination reagent including 
25 a labeled nucleotide terminator is included in the second primer extension reaction. Preferably, 
the labeled nucleotide terminator is selected to be complementary to the nucleotide at the 3'- 
end of the 3'-flanking portion of the target nucleic acid which abuts the repeat region of such 
target nucleic acid. 

30 C. Separation of Primer and Primer Extension Reagents 

Between the first and second primer extension reactions, a separation step is performed 

to prevent the mixing of first and second primer extension reagents and thereby prevent 

uncontrolled primer extension. The means used to effect the separation step may be any means 

capable of separating the target-primer hybrid from the first and/or second primer extension 
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reagents. Exemplary separation methods include but are not limited to HPLC, electrophoresis, 
liquid-liquid extraction, solid-liquid extraction, adsorption, and the like. 

In a preferred embodiment, the target-primer hybrid is attached to a solid support 
5 during the separation step such that the primer extension reagents may be separated from the 
target-primer hybrid simply by washing the solid support. According to this embodiment, the 
primer may be attached to the solid support before or after performing the first primer 
extension reaction. The washing conditions are such that the target-primer hybrid is not 
substantially disrupted and nonspecific adsorption of the primer extension reagents is 
10 minimized. 

D. Measuring a Signal 

Subsequent to either the first or second primer extension reaction, a detection step is 
performed wherein the amount of intact label which has been incorporated into a primer 
15 extension product is determined. Any detection method may be used which is suitable to the 
type of label employed. Thus, possible detection methods include radioactive detection, 
optical absorbance detection, e.g., UV-visible absorbance detection, optical emission detection, 
e.g., fluorescence or chemi luminescence. 

20 The measuring step can take place at various points in the process depending upon the 

aspect of the invention being practiced and the composition of the first and second primer 
extension reagents. In the first aspect, preferably the measuring step takes place after the 
primer extension reagent including the labeled extendible nucleotide has been reacted with and 
separated from the target-primer hybrid. In the second aspect, the measuring step takes place 

25 after the primer termination reagent including the labeled nucleotide terminator has been 
reacted with and separated from the target-primer hybrid 

If the target-primer hybrid(s) are immobilized on a solid support for analysis, extended 

primers can be detected in an addressable array by scanning all or portions of each array 

30 simultaneously or serially, depending on the scanning method used. For fluorescence labeling, 

selected regions on an array may be serially scanned one-by-one or row-by-row using a 

fluorescence microscope apparatus, such as described in Fodor (1995) and Mathies et al. 

(1992). Hybridization patterns may also be scanned using a CCD camera (e.g., Model 

TE/CCD512SF, Princeton Instruments, Trenton, NJ) with suitable optics (Ploem, 1993), such 
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as described in Yershov et al. (1996), or may be imaged by TV monitoring (Khrapko, 1991). 
For radioactive signals (e.g., 32 P), a phosphorimager device can be used (Johnston et al., 1990; 
Drmanac et aL 1992; 1993). Other commercial suppliers of imaging instruments include 
General Scanning Inc., (Watertown, MA, www.genscan.com), Genix Technologies (Waterloo, 
5 Ontario, Canada; www.confocal.com), and Applied Precision Inc. Such detection methods are 
particularly useful to achieve simultaneous scanning of multiple tag complement regions. 

E. Rendering A Label Undetectable 
10 According to the first aspect of the method of the invention, once a signal from a label 

is detected, prior to performing a subsequent cycle of discrete primer extension, the label is 
rendered undetectable. 

In one preferred embodiment, the label is rendered undetectable by cleaving the label 
15 off of the labeled extendible nucleotide incorporated in the primer extension product. The 
method used for cleaving the label from the nucleotide depends on the type of cleavable 
linkage used to link the label and the nucleotide. See above. Exemplary cleavage reagents 
include thiol, base, periodate, hydroxylamine and the like. In one preferred method, the label 
is attached to a base-portion of a uracil nucleotide and subsequent to detection the label is 
20 cleaved off of the labeled extendible nucleotide by treatment with the enzyme uracil N- 
glycosylase (UNG). 

In a second preferred embodiment, the label is rendered undetectable by destroying the 
signal-producing properties of the label itself. Depending on the type of label used, there are 

25 several methods which may be employed for destroying the signal-producing properties of the 
label. For example, if the label is a fluorescent dye, the label may be rendered undetectable by 
photobleaching the dye using an intense light source in an oxygen-rich environment. If the 
label is an enzyme, the label may be rendered undetectable by reacting the label with an 
irreversible enzyme inhibitor which renders the enzyme incapable of catalyzing a signal 

30 producing reaction. If the label is a chemiluminescent agent which can undergo only a single 
light-producing transformation, the label is autodistructing and thus does not require a separate 
reaction to render the label undetectable (Bronstein). 
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F. Tatz Complement Arrays 

As noted above, the invention also includes embodiments that utilize primers or sample 
polynucleotides that contain tag sequences that uniquely identify the attached primers or 
sample polynucleotides, which can be immobilized on addressable arrays for analysis of 
5 multiple sample polynucleotides in parallel. 

A plurality of different-sequence primers are contacted with a polynucleotide sample 
under conditions effective for the primers to anneal to primer-complementary regions in one or 
more target polynucleotides, to form one or more target-primer hybrid(s), wherein either (1) 

10 each different-sequence primer contains (i) a target binding segment and (ii) a tag segment 
having a nucleotide sequence that uniquely identifies the target binding segment, or (2) one or 
more polynucleotides in the sample are tagged polynucleotides that contain a tag segment 
having a nucleotide sequence that uniquely identifies the attached polynucleotide. One or 
more primer extension cycles are performed as described above, and the appearance or loss of 

15 signal is determined at the appropriate times to measure repeat lengths. 

Depending on the preferences of the user, the tagged primers or tagged sample 
polynucleotides can be contacted with an addressable array of immobilized, different-sequence 
tag complements which each contain a sequence that is complementary to one of the tag 
20 segments, under conditions effective to hybridize the tag segments to corresponding tag 
complements on the support. By way of illustration, embodiments using either tagged primers 
or tagged sample polynucleotides are shown in Figs. 4 and 5, respectively, after the tag 
segments have been hybridized to a solid phase support, and a target-primer hybrid has been 
formed to yield tertiary polynucleotide complexes. 

25 

Fig. 4 shows a tertiary complex wherein a tag complement oligonucleotide 410 

immobilized on a solid phase support 400 is specifically hybridized to a tagged primer 420. 

Tagged primer 420 contains a target binding segment 422 and a tag segment 424. The vertical 

lines between the tag complement oligonucleotide 410 and tag segment 424 indicate 

30 complementary base-pairing. Also shown is a sample polynucleotide 440 comprising a 

sequence 442 which hybridizes to target binding segment 422, and which, in the embodiment 

shown, terminates at a nucleotide immediately adjacent to a sample repeat region 444. Sample 

polynucleotide 440 optionally includes flanking sequences 446 and 448 which do not hybridize 

to the tagged primer. Once polynucleotide 440 and tagged primer 420 have formed a target- 
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primer hybrid, the hybrid can be treated as discussed in preceding sections to extend the 
extendable end of the primer into the repeat region of the sample polynucleotide. Extension 
cycles are repeated until the end of the repeat region has been reached, or the desired number 
of repeats have been counted. 

5 

Fig. 5 shows an alternative tertiary complex wherein a tag complement oligonucleotide 
510 immobilized on a solid phase support 500 is specifically hybridized to a tagged sample 
polynucleotide 520. Polynucleotide 520 includes a tag segment 524 which is hybridized to 
oligonucleotide 510, and which is connected to a sample sequence 522. Sequence 522 includes 

10 a repeat region 528 which is flanked on either side by first and second sample segments 526 
and 530. An extendable primer 540 is annealed to one of sample segments 526 and 530 to form 
a target-primer hybrid such that the extendable end is adjacent to (or protrudes into) the sample 
repeat region 528. The hybrid can be treated as above to extend the extendable end of the 
primer into the repeat region of the sample polynucleotide, until the end of the repeat region 

15 has been reached. 

Tagged sample polynucleotides can be formed by any suitable method. Conveniently, 
tagged samples can be formed by PCR amplification using primer pairs comprising first and 
second target-specific primers which flank each sample sequence of interest, wherein one of 

20 the probes contains an identifier tag segment. In one embodiment, the tagged primer is 
constructed so that the tag segment is not copied during amplification, either due to the 
presence of non-standard internucleoside linkages (e.g., PNA linkages), or due to the presence 
of a non-polynucleotide linker region which separates the tag segment from the target-binding 
segment, as can be prepared by standard synthetic methods. After the tag segment is hybridized 

25 to a tag complement on an addressable array, the non-tagged strand can be removed from the 
tagged strand, e.g., by elevated temperature and/or reducing salt concentration to destabilize 
DNA-DNA duplex structure, particularly if either the tag melts from the tag complement at a 
higher temperature than the melting temperature between the target-binding segment of the 
primer and the sample polynucleotide. Many other approaches for obtaining a selected strand 

30 from duplex nucleic acids are known in the art, and include for example, (1) the use of a 

biotinylated primer in PCR to enable capture and separation of the non-tagged strand from the 

tagged strand under denaturing conditions, (2) the use of a PCR primer (non-tagged) that 

contains a short RNA segment which can be degraded with RNAse after PCR is complete, 

followed by destruction of the second RNAse degraded strand with an enzyme that has an 
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exonuclease activity (e.g., an appropriate DNAse), or an asymmetric PCR method which 
favors amplification of the tagged primer strand. 

Although Figs. 4 and 5 show particular embodiments, it will be apparent that other 
5 variations can be used in accordance with the methods of the invention. For example, with 
reference to Fig. 5, extendable primer 540 can be designed to bind instead to sample segment 
526, between the tag segment and the repeat region, such that the extendable end of the primer 
is again adjacent to sample repeat region 528. 

10 The tagged primers or tagged sample polynucleotides can be hybridized to a tag 

complement array at any appropriate time during the primer extension process. For example, 
the tagged primers of sample polynucleotides can be immobilized on the support prior to the 
first contacting step in which the different-sequence primers and sample polynucleotides are 
annealed to form target-primer hybrids. Alternatively, immobilization can be performed after 

15 the first primer extension reaction, or after the second primer extension reaction. In the latter 
case, a small aliquot of the extension reaction mixture can be removed after each extension 
cycle and hybridized to replicate arrays, such that each replicate array provides counting 
information after each cycle. 

20 IV. KITS FOR PRACTICING THE METHOD 

The present invention includes kits for carrying out the various aspects and 
embodiments of the methods of the invention. In a first aspect, kits of the invention include a 
primer, a first primer extension reagent, and a second primer extension reagent, wherein at 
least one of the first or second primer extension reagents includes an extendible nucleotide 

25 having a label attached thereto. Preferably, the label attached to the extendible nucleotide may 
be rendered undetectable following a treatment step effective to cleave the label from a primer 
extension product and/or destroy the signal producing properties of the label. Preferably, the 
primer is bound to a solid support, or, is capable of being bound to a solid support through a 
specific binding pair or through a covalent linking moiety. In another preferred embodiment, 

30 this aspect of the invention includes a solid-phase support for attachment of a target-primer 
hybrid through the primer or the target nucleic acid. Optionally, the kits, of this first aspect of 
the invention include a primer termination reagent. 
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In a second aspect, the kits of the invention include a primer, a first primer extension 
reagent, a second primer extension reagent, and a primer termination reagent, wherein the 
primer termination reagent includes a nucleotide terminator having a label attached thereto. 
The second primer extension reagent and the primer termination reagent may be packaged 
5 either separately or together as a mixture. Preferably, the primer is bound to a solid support, 
or, is capable of being bound to a solid support through a specific binding pair or through a 
covalent linking moiety. In another preferred embodiment, this aspect of the invention includes 
a solid-phase support for attachment of a target-primer hybrid through the primer or the target 
nucleic acid. 

10 

In another embodiment, the kits can include (A) a plurality of different-sequence 
primers, each containing (i) a target binding segment and (ii) a tag segment having a nucleotide 
sequence that uniquely identifies the target binding segment; a first primer extension reagent; 
and a second primer extension reagent, wherein at least one of the first or second primer 
15 extension reagents includes an extendible nucleotide having a label attached thereto; and/or (B) 
an addressable array of immobilized, different tag complements, wherein each different tag 
complement contains a sequence that is complementary to one of the primer tag segments, 
under conditions effective to hybridize the tag segments to corresponding tag complements on 
the support. 

20 

The invention may be further understood in light of the following examples, which are not 
intended to limit the invention. 

Example 

25 The following study illustrates exemplary sample hybridization conditions and 

extension cycles for counting repeats in four different sample sequences using an array of 
target-specific capture oligonucleotides immobilized on a solid support. 

A. Support . Glass microscope slides were immersed in 1 N HNO3 for 1 to 2 hours, 
30 followed by rinsing with deionized water. Optionally, slides were soaked overnight in 5% HC1 
to improve long-term stability of subsequent functionalizization. Slides were then sonicated 
sequentially in the following three solvents for 10 minutes each: hexane, acetone, and ethanol, 
followed by drying in air. 
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A solution of 2% (v/v) aminopropyltriethoxy silane (0.8 mL in 40 mL) in 95% 
acetone:5% water was prepared in a disposable plastic Falcon tube. This amount of solution 
was usually sufficient to coat at least three slides. The solution was allowed to stand for 5 to 
20 minutes to hydrolyze the ethoxy groups to hydroxyl groups. Air-dried slides were dipped in 
5 the solution for 2 minutes each, and then rinsed by dipping in three or more successive acetone 
baths. 

The slides were cured in 100°C oven for 45 minutes. Cured slides were treated for 2 
hours with a 0.2% solution of 1 ,4-phenylene diisothiocyanate (PDITC) in 10% 

10 pyridine:dimethyl formamide, followed by washes in baths of methanol and acetone, and air- 
drying. The slides may be stored under vacuum with a dessicant at 4°C. Stacking the slides 
also helps preserve the functionalization and keeps the functinalized surfaces free from 
particulates. (In an alternative embodiment, instead of 1 ,4-phenylene diisothiocyanate, the 
cured slides can be treated with a 1 mM solution of EMCS (N-(e-maleimido caproxyl) 

15 succinimide in methanohdimethylsulfoxide (80:20) for 2 hours.) 

B. Immobilization of Capture Oligos . Synthetic capture oligonucleotides were 
prepared by standard phosphoramidite synthetic methods. The capture oligonucleotides had the 
following general structure: 5'-amino-[(PEO) 2 ]-[capture oligo (24 nt)], where PEO = -O- 
20 (CH 2 CH 2 0) 6 added via DMT-0-(CH 2 CH 2 0) 6 -phosphoramidite. The capture oligonucleotides 
had polynucleotide sequences complementary to the double-underlined sequences in the four 
sample sequences shown in section C below. 

The capture oligos were spotted in a rectangular array pattern onto slides prepared as 
25 above using an ASYMTEC robotic loader (Asymtec, Carlsbad, CA, Model No. C-708), with 
oligonucleotide concentrations of about 50 pM (in 100 mM Na 2 C0 3 , pH 9.0) and spotting 
volumes of about 0.2 to 1 nL. The pattern included a separate set of 8 spots (2x4 rows) for 
each different capture oligo to provide redundancy. The spots had diameters of about 200 
micrometers and were spaced apart by 320 micrometers center-to-center. The robotic loader 
30 included a slide holder to hold the slides during oligonucleotide deposition, and a blotting area 
for cleaning dispenser tips. 
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After spotting, the capture oligos were fixed onto the slides by incubating the slides in a 
humidity chamber at room temperature for 60 minutes, followed by soaking in 1% NH 4 OH 
(aq) for 20 minutes and in 0.2% casein in TBS (50 mM Tris, 150 mM NaCL pH 8) for 20 
minutes. The slides were washed in deionized water and stored at 4°C. 

5 

C. Sample Hybridization and Analysis . Four different sample oligos were prepared to 
represent four different microsatellite alleles. In brief, the sample polynucleotides included the 
following sequences, where single-underlining indicates repeat regions, and double underlining 
indicates segments for hybridizing to complementary capture oligos on the support: 

10 

Sample Oligo 2260-2G (SEP ID NO:!) 

5 '-GTC AGG AC AC AAAGTG ATTTG ATGT AG ATTTTG A -3 1 

Sample Oligo 1411-lGfSEO IDNO:2) 
15 5 * -GTCAGGACT GATAAAGTGTAAAAG TGTATGAT-3 ' 

Sample Oligo 3149-2T (SEP ID NO:3) 

5 ' -GTCATT ACACTGTATGATAAAGGATTTTGATTGA -3 ' 

20 Sample Oligo 4223-4T (SEP ID NO:4) 

5 ' -GTCATT ACACACACGTATTGATTTGATTGATTGAGATT -3 1 

A mixture of the four sample oligos (0.33 jiM each in IX PCR buffer containing L5 
mM MgCl 2 , 50 mM KC1, 1.5 mM MgCl 2 , 10 mM Tris-HCl, pH 8.3, and 0.001% (w/v) gelatin) 
25 was placed over the capture oligos on the array and allowed to incubate for 1 hour at room 
temperature. After the incubation, unbound oligos were removed with IX TE (10 mM Tris, 
pH 8, 1 mM EDTA) containing 50 mM NaCL Replicate slides were subjected to different 
numbers of extension cycles (one, two, three or four cycles). Each extension cycle (also called 
a "reagent cycle") involved the following steps: 

30 

1) Incubate slide for 4 minutes at 37°C in an extension mixture containing: 
500 ^M dGTP 

20 nM Big Dye R6G ddATP (Perkin-Elmer, Foster City, CA, Emax = 560 nm) 
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20 nM Big Dye dRox ddCTP (Perkin-Elmer, Foster City, CA, Emax = 615 run) 
1 U/nL AMPLITAQ FS (plus 0.125 U/jiL pyrophosphatase) 
IX buffer (80 mM Tris, pH 9.0, 2.5 mM MgCl 2 ) 

5 2) Rinse slide with IX TE containing 50 mM NaCL then wash 3 times by immersion in same 
solution. 

3) Incubate slide for 4 minutes at 37°C in an extension mixture containing: 
500 \iM dATP 

10 20 jiM Big Dye R6G ddATP (Perkin-Elmer, Foster City, CA, Emax 

20 \iM Big Dye dRox ddCTP (Perkin-Elmer, Foster City, CA, Emax 
1 U/^L AMPLITAQ FS (plus 0. 125 U/jiL pyrophosphatase) 
IX buffer (80 mM Tris, pH 9.0, 2.5 mM MgCl 2 ) 

15 4) Rinse slide with IX TE containing 50 mM NaCl, then wash 3 times by immersion in same 
solution. 

After 4 extension cycles had been completed (each cycle included steps I through 4), each 
slide was overlaid with a viewing buffer containing 50% (w:v) urea and IX TBE (0.09 M 
20 Tris-borate, 2 mM EDTA, pH - 8.3) to provide an alkaline pH environment to enhance the 
fluorescence emissions of the dye labels. 

The slides were each viewed using an imaging device set for fluorescence emission 
detection at 560 nm and 610 nm. The device was an imaging fluorimeter that produces a two- 

25 dimensional image array of emission intensities (electronic image). Each point in the image 
array corresponds to a physical location on the sample slide. The excitation source is a 40 mW 
argon ion laser. The excitation wavelength can be selected from either of two natural laser 
lines (488 nm and/or 514 nm). The selection of laser wavelengths is accomplished by passing 
the beam which contains light of both wavelengths through one of three filters. The selected 

30 filter will pass either 488 nm only, 514 nm only, or both 488 and 514 nm. The filters are 
mounted on a moter driven platform so that the selection can be performed under computer 
control. The beam containing either or both wavelengths passes through a classical beam 
expander. The collimated expanded beam is then directed into a commercially available 



= 560 nm) 
= 615 nm) 
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assembly (Scanlab, Puchhein, Germany, Model SKI 020) containing two orthogonal "galvo 
mirrors". This is a mechanical device designed to rotate each of the two mirrors quickly and 
precisely over a small angle. The axes of rotation are orthoganal and independent so that the 
beam can be rastered over a rectangular pattern, also under computer control. 

5 

The scanned beam is focused by an f-theta lens which forms a scanned laser spot at one 
focal length. The spot size is 20 urn diameter. Typically, it is stepped over an area of 20 mm x 
20 mm in increments of 20 ^m, thereby forming a 1000 x 1000 array of pixels. The focused 
laser spot is formed after passing through a dichroic beam splitter which is designed to 
1 0 reflected the laser excitation wavelengths but transmit the fluorophore emission. 

As the laser beam steps across a fluorescent region, the fraction of fluorescence 
radiation within the solid angle of collection of the emission optics is transmitted by the 
dichroic mirror and is collimated by a lens having a 150 mm focal length. The collimated 

15 beam is then filtered by a long pass interference filter which further rejects laser light. A 
second lens of 75 mm focal length focuses the beam onto the photocathode of a red sensitive 
photomultiplier tube (PMT). The PMT photocathode does not need to be carefully adjusted 
since the focus is not critical. A filter wheel in front of the PMT allows only a small 
wavelength band (10 nm) to reach the detector. The filter wheel can be adjusted to one of six 

20 positions to allow for emission of multiple fluorophores to be discriminated. 

The electrical output of the PMT is proportional to the intensity of the light reaching 
the photocathode. The computer system is capable of storing the signals at each of an array of 
mirror positions, to form the electronic image. 

25 

The results were as expected. Specifically, for oligo 1 (SEQ ID NO:l), no signal was 
observed until completion of the second cycle, after which two GT dimers and a fluorescence- 
labeled ddC terminator had been appended to the immobilized capture oligo, which also served 
as the extendable primer. Similarly, for oligos 2 through 4 (SEQ ID NO:2 through 4), the 
30 appropriate fluorescent signal was observed after 1, 2 and 4 extension cycles, respectively. 

All publications and patent applications refered to in this disclosure are herein 

incorporated by reference to the same extent as if each individual publication or patent 

application was specifically and individually indicated to be incorporated by reference. 
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Although only a few embodiments have been described in detail above, those having 
ordinary skill in the molecular biology art will clearly understand that many modifications are 
possible in the preferred embodiments without departing from the teachings thereof. 
5 Accordingly, all such modifications are intended to be encompassed within the scope of the 
following claims. 
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WE CLAIM: 

1 . A method for determining the number of repeat units in a repeat region of a target 
5 nucleic acid comprising the steps of: 

(A) contacting a plurality of different-sequence primers with a polynucleotide sample 
under conditions effective for said primers to anneal to primer-complementary regions in one 
or more target polynucleotides, to form one or more target-primer hybrid(s), wherein either (1) 
each different-sequence primer contains (i) a target binding segment and (ii) a tag segment 

10 having a nucleotide sequence that uniquely identifies the target binding segment, or (2) one or 
more polynucleotides in the sample are tagged polynucleotides that contain a tag segment 
having a nucleotide sequence that uniquely identifies the attached polynucleotide, 

(B) performing a first primer extension reaction on said hybrid(s) using a first primer 
extension reagent; 

1 5 (C) separating the target-primer hybrid(s) and unreacted first primer extension reagent; 

(D) performing a second primer extension reaction on said hybrid(s) using a second 
primer extension reagent, wherein at least one of the first or second primer extension reagents 
includes an extendible nucleotide having a label attached thereto; 

(E) separating the target-primer hybrid(s) from unreacted second primer extension 
20 reagent; 

(F) measuring a signal produced by the label; 

(G) treating the label so as to render the label undetectable; and 

(H) repeating a cycle of steps (A) through (G) until the signal detected in the target- 
primer hybrid(s) is substantially less than a signal detected in a previous cycle, 

25 wherein (I) prior to step (F), at least an aliquot of either (1) said different-sequence 

primers or (2) said tagged sample polynucleotides are contacted with an addressable array of 
immobilized, different tag complements, and each different tag complement contains a 
sequence that is complementary to one of said tag segments, under conditions effective to 
hybridize the tag segments to corresponding tag complements on the support. 

30 

2. The method of claim 1 wherein step (D) further includes reacting the target-primer 
hybrid(s) with a primer termination reagent. 
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3. The method of claim 1 wherein each different-sequence primer contains (i) a target 
binding segment and (ii) a tag segment having a nucleotide sequence that uniquely identifies 
the target binding segment. 

4. The method of claim 3 wherein the contacting in step (I) is performed prior to step 

(A). 

5. The method of claim 3 wherein the contacting in step (I) is performed after step (A). 

6. The method of claim 3 wherein said method is performed on at least two replicate 
arrays, and one of the replicate arrays is subjected to at least one more cycle of steps (A) 
through (G) than is a second replicate array. 

7. The method of claim 1 wherein one or more polynucleotides in the sample are tagged 
polynucleotides that contain a tag segment having a nucleotide sequence that uniquely 
identifies the attached polynucleotide. 

8. The method of claim 7 wherein the contacting in step (I) is performed prior to step 

(A). 

9. The method of claim 7 wherein the contacting in step (I) is performed after step (A). 

10. The method of claim 7 wherein said method is performed on at least two replicate 
anrays, and one of the replicate arrays is subjected to at least one more cycle of steps (A) 
through (G) than is a second replicate array. 

11. The method of claim 1 wherein the label is selected from the group consisting of 
fluorescent and chemiluminescent molecules. 

12. The method of claim 1 wherein the label is attached to the extendible nucleotide 
through a cleavable linker. 

13. The method of claim 1 wherein the target nucleic acid is amplified prior to 
analysis. 
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14. The method of claim 13 wherein amplification is achieved using a PCR. 

15. The method of claim 1 wherein the step of treating the label so as to render the label 
5 undetectable includes cleaving the label from the labeled extendible nucleotide. 

16. The method of claim 1 wherein the step of treating the label so as to render the label 
undetectable includes destroying a signal-producing property of the label. 

10 1 7. A method for determining the number of repeat units in a repeat region of a target 

nucleic acid comprising the steps of: 

(A) contacting a plurality of different-sequence primers with a polynucleotide sample 
under conditions effective for said primers to anneal to primer-complementary regions in one 
or more target polynucleotides, to form one or more target-primer hybrid(s), wherein either (1) 

15 each different-sequence primer contains (i) a target binding segment and (ii) a tag segment 
having a nucleotide sequence that uniquely identifies the target binding segment, or (2) one or 
more polynucleotides in the sample are tagged polynucleotides that contain a tag segment 
having a nucleotide sequence that uniquely identifies the attached polynucleotide, 

(B) performing a first primer extension reaction on said hybrid(s) using a first primer- 
20 extension reagent; 

(C) separating the target-primer hybrid(s) from unreacted first primer extension 
reagent; 

(D) performing a second primer extension reaction on said hybrid(s) using a second 
primer extension reagent and with a primer termination reagent, the primer termination reagent 

25 including a nucleotide terminator having a label attached thereto; 

(E) separating the target-primer hybrid(s) from unreacted second primer extension 
reagent and unreacted primer termination reagent; 

(F) measuring a signal produced by the label; and 

(G) repeating a cycle of steps (A) through (F) until a signal is detected indicating 
30 incorporation of the nucleotide terminator, 

wherein (H) prior to step (F), at least an aliquot of either (1) said different-sequence 
primers or (2) said tagged sample polynucleotides are contacted with an addressable array of 
immobilized, different tag complements, and each different tag complement contains a 
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sequence that is complementary to one of said tag segments, under conditions effective to 
hybridize the tag segments to corresponding tag complements on the support. 

1 8. The method of claim 1 7 wherein each different-sequence pnmer contains (i) a 
5 target binding segment and (ii) a tag segment having a nucleotide sequence that uniquely 

identifies the target binding segment. 

19. The method of claim 18 wherein the contacting in step (H) is performed prior to 
step (A). 

10 

20. The method of claim 1 8 wherein the contacting in step (H) is performed after step 

(A). 

21. The method of claim 18 wherein said method is performed on at least two replicate 
15 arrays, and one of the replicate arrays is subjected to at least one more cycle of steps (A) 

through (F) than is a second replicate array. 

22. The method of claim 1 7 wherein one or more polynucleotides in the sample are 
tagged polynucleotides that contain a tag segment having a nucleotide sequence that uniquely 

20 identifies the attached polynucleotide. 

23. The method of claim 22 wherein the contacting in step (H) is performed prior to 
step (A). 

25 24. The method of claim 22 wherein the contacting in step (H) is performed after step 

(A). 



25. The method of claim 22 wherein said method is performed on at least two replicate 
arrays, and one of the replicate arrays is subjected to at least one more cycle of steps (A) 

30 through (F) than is a second replicate array. 

26. The method of claim 1 7 wherein the label is selected from the group consisting of 
fluorescent and chemiluminescent molecules. 
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27. The method of claim 1 7 wherein the target nucleic acid is amplified prior to 
analysis. 



28. The method of claim 27 wherein amplification is achieved using a PCR. 

29. A kit useful for determining the number of repeat units in a repeat region of a target 
nucleic acid comprising: 

a plurality of different-sequence primers, each containing (i) a target binding segment 
and (ii) a tag segment having a nucleotide sequence that uniquely identifies the target binding 
segment, 

a first primer extension reagent; and 

a second primer extension reagent, wherein at least one of the first or second primer 
extension reagents includes an extendible nucleotide having a label attached thereto. 

30. The kit of claim 29, which further includes an addressable array of immobilized, 
different tag complements, wherein each different tag complement contains a sequence that is 
complementary to one of said primer tag segments, under conditions effective to hybridize the 
tag segments to corresponding tag complements on the support. 

31. The kit of claim 29 wherein the label is selected from the group consisting of 
fluorescent and chemiluminescent molecules. 

32. The kit of claim 29 wherein the label is attached to the extendible nucleotide 
through a cleavable linker. 
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LENGTH DETERMINATION OF NUCLEIC ACID REP EAT SEQUENCES BY 
DISCONTINUOUS PRIMER EXTENSION 



FIELD OF THE INVENTION 

This invention relates to methods and kits useful for determining the length of nucleic 
acid repeat sequences. More specifically, this invention relates to methods and kits useful for 
determining the length of nucleic acid repeat sequences by employing a discontinuous primer 
extension reaction. 
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BACKGROUND 

Methods for the analysis of genetic polymorphism have found wide utility in basic 
research, clinical diagnostics, forensics, and other areas. One particularly useful method of 
detecting genetic polymorphism is based on variations in the length of repeat sequences, such 
1 5 methods being variously referred to as short tandem repeat analysis (STR), variable number of 
tandem repeat analysis (VNTR), minisatellite analysis, and microsatellite analysis. 

Detection of length polymorphisms in nucleic acid repeat sequences has up to now 
relied on gel electrophoresis for the determination of the length of the repeat sequence. 

20 However, gel electrophoresis has several important drawbacks in the context of repeat 
sequence length polymoiphism analysis. First, molecular length measurements based on 
electrophoretic mobility are inherently imprecise due to a complicated relationship between 
molecular size and electrophoretic mobility. Second, the degree to which the electrophoretic 
process can be multiplexed is limited by the number of electrophoresis lanes and by the size of 

25 different loci run in a single lane, i.e., loci must be selected which do not electrophoretically 
co-migrate. 

SUMMARY 

The method of the present invention comprises a discontinuous primer extension 
30 reaction wherein a primer is extended in discrete increments such that in each increment of 
primer extension the primer is extended by an amount corresponding to a single repeat unit. 
Following each increment of discrete primer extension, a detection step is performed in which 
a modulation in a signal is detected when the primer has been extended by an amount equal to 
the total length of a repeat region. Thus, by counting the number of increments of discrete 
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primer extension required to cause a modulation in the signal, the number of repeat units 
making up the repeat region is determined. 

It is an object of the present invention to provide a precise and reproducible method for 
5 determining the number of repeat units making up a repeat region of a nucleic acid repeat 
sequence. 

It is another object of the present invention to provide a method for determining the 
number of repeat units making up a repeat region of a nucleic acid repeat sequence which can 
10 perform a large number of measurements in parallel 

It is yet an additional object of the present invention to provide a method for 
determining the number of repeat units making up a repeat region of a nucleic acid repeat 
sequence which does not require an electrophoretic separation. 

15 

It is an object of the present invention to provide kits and reagents useful for practicing 
a method for determining the number of repeat units making up a repeat region of a nucleic 
acid repeat sequence having the above described characteristics. 

20 In a first aspect, the foregoing and other objects of the invention are achieved by a 

method for determining the number of repeat units in a repeat region of a target nucleic acid 
comprising annealing a primer-complementary portion of a target nucleic acid to a primer 
thereby forming a target-primer hybrid; performing a first primer extension reaction using a 
first primer extension reagent; separating the target-primer hybrid and unreacted first primer 

25 extension reagent; performing a second primer extension reaction using a second primer 
extension reagent, wherein at least one of the first or second primer extension reagents includes 
an extendible nucleotide having a label attached thereto; separating the target-primer hybrid 
from unreacted second primer extension reagent; measuring a signal produced by the label; 
treating the label so as to render the label undetectable; and repeating the above steps until the 

30 signal is substantially less than a signal detected in a previous cycle. 

In one preferred embodiment of the first aspect of the invention, the step of performing 
a second primer extension reaction further includes reacting the target-primer hybrid with a 
primer termination reagent. 

4 
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In yet another preferred embodiment of the first aspect of the invention, the label is a 
fluorescent or chemiluminescent molecule. 

5 In another preferred embodiment of the first aspect of the invention, the label is 

attached to the extendible nucleotide through a cleavable linker. 

In an additional preferred embodiment of the first aspect of the invention, the target 
nucleic acid is amplified prior to analysis. Preferably such amplification is achieved using a 
10 PCR. 

In another preferred embodiment of the first aspect of the invention, the step of treating 
the label so as to render the label undetectable includes either cleaving the label from the 
labeled extendible nucleotide or destroying a signal producing property of the label. 

15 

In another preferred embodiment of the first aspect of the invention, the target-primer 
hybrid is attached to a solid support. Preferably, one of the primer or the target nucleic acid is 
attached to the solid support. 

20 In a novel, preferred embodiment employing a solid support, the invention includes a 

method for determining the number of repeat units in a repeat region of a target nucleic acid 
comprising the steps of: 

(A) contacting a plurality of different-sequence primers with a polynucleotide sample 
under conditions effective for the primers to anneal to primer-complementary regions in one or 

25 more target polynucleotides, to form one or more target-primer hybrid(s), wherein either (1) 
each different-sequence primer contains (i) a target binding segment and (ii) a tag segment 
having a nucleotide sequence that uniquely identifies the target binding segment, or (2) one or 
more polynucleotides in the sample are tagged polynucleotides that contain a tag segment 
having a nucleotide sequence that uniquely identifies the attached polynucleotide, 

30 (B) performing a first primer extension reaction on the hybrid(s) using a first primer 

extension reagent; 

(C) separating the target-primer hybrid(s) and unreacted first primer extension reagent; 
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(D) performing a second primer extension reaction on the hybrid(s) using a second 
primer extension reagent, wherein at least one of the first or second primer extension reagents 
includes an extendible nucleotide having a label attached thereto; 

(E) separating the target-primer hybrid(s) from unreacted second primer extension 

5 reagent; 

(F) measuring a signal produced by the label; 

(G) treating the label so as to render the label undetectable; and 

(H) repeating a cycle of steps (A) through (G) until the signal detected in the target- 
primer hybrid(s) is substantially less than a signal detected in a previous cycle, 

10 wherein (I) prior to step (F), at least an aliquot of either (1) the different-sequence 

primers or (2) the tagged sample polynucleotides are contacted with an addressable array of 
immobilized, different tag complements, and each different tag complement contains a 
sequence that is complementary to one of the tag segments, under conditions effective to 
hybridize the tag segments to corresponding tag complements on the support. 

15 

In one embodiment, the contacting in step (I) is performed prior to step (A). In another 
embodiment, the contacting in step (I) is performed after step (A), and/or before any one of 
steps (B), (C), (D), (E), and (F). In yet another embodiment, steps (A) through (H) are 
performed on at least two replicate arrays, and one of the replicate arrays is subjected to at least 
20 one more cycle of steps (A) through (G) than is a second replicate array. 

In a second aspect, the foregoing and other objects of the invention are achieved by a 
method for determining the number of repeat units in a repeat region of a target nucleic acid 
comprising annealing a primer-complementary portion of a target nucleic acid to a primer 

25 thereby forming a target-primer hybrid; performing a first primer extension reaction using a 
first primer-extension reagent; separating the target-primer hybrid from unreacted first primer 
extension reagent; performing a second primer extension reaction using a second primer 
extension reagent and with a primer termination reagent, the primer termination reagent 
including a nucleotide terminator having a label attached thereto; separating the target-primer 

30 hybrid from unreacted second primer extension reagent and unreacted primer termination 
reagent; measuring a signal produced by the label; and repeating the above steps until a signal 
is detected indicating incorporation of the labeled nucleotide terminator into the primer 
extension product. 
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In yet another preferred embodiment of the second aspect of the invention, the label is 
selected from the group consisting of fluorescent and chemiluminescent molecules. 

In an additional preferred embodiment of the second aspect of the invention, the target 
nucleic acid is amplified prior to analysis. Preferably such amplification is achieved using a 
PCR. 

In a preferred embodiment of the second aspect of the invention, the target-primer 
hybrid is attached to a solid support. Preferably, one of the primer or the target nucleic acid is 
attached to the solid support. 

In a novel, preferred embodiment,the invention includes a method for determining the 
number of repeat units in a repeat region of a target nucleic acid comprising the steps of: 

(A) contacting a plurality of different-sequence primers with a polynucleotide sample 
under conditions effective for the primers to anneal to primer-complementary regions in one or 
more target polynucleotides, to form one or more target-primer hybrid(s), wherein either (1) 
each different-sequence primer contains (i) a target binding segment and (ii) a tag segment 
having a nucleotide sequence that uniquely identifies the target binding segment, or (2) one or 
more polynucleotides in the sample are tagged polynucleotides that contain a tag segment 
having a nucleotide sequence that uniquely identifies the attached polynucleotide, 

(B) performing a first primer extension reaction on the hybrid(s) using a first primer- 
extension reagent; 

(C) separating the target-primer hybrid(s) from unreacted first primer extension 
reagent; 

(D) performing a second primer extension reaction on the hybrid(s) using a second 
primer extension reagent and with a primer termination reagent, the primer termination reagent 
including a nucleotide terminator having a label attached thereto; 

(E) separating the target-primer hybrid(s) from unreacted second primer extension 
reagent and unreacted primer termination reagent; 

(F) measuring a signal produced by the label; and 

(G) repeating a cycle of steps (A) through (F) until a signal is detected indicating 
incorporation of the nucleotide terminator, 

wherein (H) prior to step (F), at least an aliquot of either (1) the different-sequence 

primers or (2) the tagged sample polynucleotides, are contacted with an addressable array of 
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immobilized, different tag complements, and each different tag complement contains a 
sequence that is complementary to one of the tag segments, under conditions effective to 
hybridize the tag segments to corresponding tag complements on the support. 

5 In one embodiment, the contacting in step (H) is performed prior to step (A). In another 

embodiment, the contacting in step (H) is performed after step (A), and/or before any one of 
steps (B), (C), (D), (E), and (F). In yet another embodiment, steps (A) through (G) are 
performed on at least two replicate arrays, and one of the replicate arrays is subjected to at least 
one more cycle of steps (A) through (F) than is a second replicate array. 

10 

In a third aspect, the foregoing and other objects of the invention are achieved by a kit 
useful for determining the number of repeat units in a repeat region of a target nucleic acid 
comprising a primer having a sequence complementary to a primer-complementary portion of 
a target nucleic acid; a first primer extension reagent; and a second primer extension reagent, 
15 wherein at least one of the first or second primer extension reagents includes an extendible 
nucleotide having a label attached thereto. 

In a preferred embodiment of the third aspect of the invention, the primer is attached to 
a solid support. 

20 

In an additional preferred embodiment of the second aspect of the invention, the label is 
selected from the group consisting of fluorescent and chemiluminescent molecules. 

In another preferred embodiment of the second aspect of the invention, the label is 
25 attached to the extendible nucleotide through a cleavable linker. 

These and other objects, features, and advantages of the present invention will become 
better understood with reference to the following description, drawings, and appended claims. 

30 BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 shows a schematic depiction of a target nucleic acid. 
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FIGS. 2A-C show a first aspect of the method of the invention wherein an extendible 
nucleotide is labeled and the label is rendered undetectable subsequent to each discrete 
increment of primer extension wherein (a), (b) and (c) refer to the following: (a) React with a 
first primer extension reagent including a labeled, extendable nucleotide T*. (b) [1] Remove 
5 unreacted first primer extension reagent. [2] React with a second pnmer extension reagent 
including an extendable nucleotide G. [3] Remove unreacted second primer extension reagent. 
[4] Measure a signal produced by T*. (c) Treat to render label undetectable. 

FIGS. 3A-B show a second aspect of the method of the invention wherein a nucleotide 
10 terminator is labeled wherein (a) and (b) refer to the following: (a) React with a first primer 
extension reagent including a labeled, extendable nucleotide T*. (b) [1] Remove unreacted 
first primer extension reagent. [2] React with a second primer extension reagent including an 
extendable nucleotide G. [3] React with a primer termination reagent including a labeled 
nucleotide terminator C*. [4] Remove unreacted second primer extension reagent and primer 
1 5 termination reagent. [5] Measure a signal produced by C*. 

FIGS 4 and 5 show exemplary schemes for practicing the invention using a solid phase 
support containing an array of tag complements for obtaining sequence repeat information for 
a plurality of different samples in parallel, using tagged primers or tagged sample 
20 polynucleotides. 

DETAILED DESCRIPTION OF THE PREFERRED E MBODIMENTS 

Reference will now be made in detail to the preferred embodiments of the invention, 
examples of which are illustrated in the accompanying drawings. While the invention will be 
25 described in conjunction with the preferred embodiments, it will be understood that they are 
not intended to limit the invention to those embodiments. On the contrary, the invention is 
intended to cover alternatives, modifications, and equivalents, which may be included within 
the invention as defined by the appended claims. 

30 I. DEFINITIONS 

Unless stated otherwise, the following terms and phrases as used herein are intended to 

have the following meanings: 



9 



SUBSTITUTE SHEET (RULE 26) 



WO 0002824 PCT/US99/28584 

"Nucleoside" refers to a compound consisting of a purine, deazapurine. or pyrimidme 
nucleoside base, e.g., adenine, guanine, cytosine. uracil, thymine, deazaadenine. 
deazaguanosine. and the like, linked to a pentose at the 1 ' position, including 2'-deoxy and 2'- 
hydroxyl forms (Stryer). 

The term "nucleotide" as used herein refers to a phosphate ester of a nucleoside, e.g., a 
triphosphate ester, wherein the most common site of esterification is the hydroxyl group 
attached at the C-5 position of the pentose. Many times in the present disclosure the term 
nucleoside will be intended to include both nucleosides and nucleotides. The terms nucleotide 
and nucleoside as used herein are intended to include synthetic analogs having modified 
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nucleoside base moieties, modified sugar moieties, and/or modified phosphate ester moieties, 
e.g., as described elsewhere (Scheit; Eckstein). 

"Polynucleotide" or "oligonucleotide" refer to linear polymers of nucleotide monomers, 

5 including single, double and triple stranded deoxyribonucleotides, ribonucleotides, a-anomeric 
forms thereof, and the like. Usually the nucleoside monomers are linked by phosphodiester 
linkages, where as used herein, the term "phosphodiester linkage" refers to phosphodiester 
bonds or bonds including phosphate analogs thereof wherein the phosphorous atom is in the 
+5 oxidation state and one or more of the oxygen atoms is replaced with a non-oxygen moiety. 

10 Exemplary phosphate analogs include phosphorothioate, phosphorodithioate, 
phosphoroselenoate, phosphorodiselenoate, phosphoroanilothioate, phosphoranilidate, 
phosphoramidate, boronophosphates, and the like, including associated counterions, e.g., H + , 
NH 4 + , Na + , and the like if such counterions are present. Alternatively, polynucleotides may 
comprise polymers of non-nucleotidic monomers, linked through phosphodiester linkages or 

15 other linkages, which are capable of forming sequence-specific hybrids with a target nucleic 
acid, e.g., peptide nucleic acid polymers (PNAs, e.g., see Knudsen, 1996). Polynucleotides 
typically range in size from a few monomelic units, e.g. 8-40, to several thousands of 
monomelic units. Whenever a polynucleotide is represented by a sequence of letters, such as 
"ATGCCTG," it will be understood that the nucleotides are in 5 f ->3' order from left to right 

20 and that "A" denotes deoxyadenosine, "C" denotes deoxycytidine, "G" denotes 
deoxyguanosine, and "T" denotes thymidine, unless otherwise noted. 

"Extendible nucleotide" means any nucleotide that when incorporated into a primer 
extension product during a primer extension reaction allows for the further extension of such 
25 primer extension product. Exemplary extendible nucleotides include 2'-deoxynucleotide 
triphosphates, e.g., 2 , -deoxyuridine-5 , -triphosphate, 2 ^-deoxyguanosine-S triphosphate, T- 
deoxy-7-deazadeoxyguanosine-5'-triphosphate, 2 , -deoxyadenosine-5 , -triphosphate, 2'- 
deoxythymidine-5 , -triphosphate, and 2'-deoxycytidine-5 triphosphate. Optionally, one or 
more of the extendible nucleotides includes a label. 

30 

"Nucleotide terminator" means any nucleotide that when incorporated into a primer 
extension product prevents the further extension of such primer extension product. One 
requirement of a nucleotide terminator is that when the nucleotide terminator includes a 
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ribofuranose sugar portion, the B'-position must not have a hydroxy group capable of being 
subsequently used by a polymerase to incorporate additional nucleotides. Alternatively, a 
ribofuranose analog could be used, such as arabinose. Exemplary nucleotide terminators 
include 2\3*-dideoxy-P-D-ribofuranosyl, P-D-arabinofuranosyl, 3'-deoxy-P-D- 
5 arabinofuranosyl. 3'-amino-2\3'-dideoxy-P-D-ribofuranosyl, and 2\3 , -dideoxy-3 , -fluoro-P- 
D-ribofuranosyl (Chidgeavadze, 1984, 1985). Nucleotide terminators also include reversible 
nucleotide terminators (Metzker), and 3'-deoxy substituents such as hydrogen, 3'-fluoro, 3'- 
amino, and 3'-azido, for example (Mikhailopulo et al., 1989; Krayevski et al. s 1984; 
Chidgeavadze, 1986). 

10 

"Polymerase" means an enzyme or other catalyst capable of catalyzing a reaction 
leading to a target-sequence dependent incorporation of a nucleotide onto a 3 '-end of a primer 
or primer extension product when such primer or primer extension product is annealed to a 
target nucleic acid. Exemplary polymerases include but are not limited to Pfu DNA 
15 polymerase, E.Coll Polymerase I, T-7 polymerase, reverse transcriptase, Taq DNA 
polymerase, and the like (Kornberg and Baker). 

"Label" means any moiety that, when attached to a nucleotide or polynucleotide of the 
invention, render such nucleotide or polynucleotide detectable using known detection means. 

20 Labels may be direct labels which themselves are detectable or indirect labels which are 
detectable in combination with other agents. Exemplary direct labels include but are not 
limited to fluorophores, chromophores, radioisotopes, spin-labels, chemiluminescent labels, 
and the like. Exemplary indirect labels include enzymes which catalyze a signal-producing 
event, and ligands such as an antigen or biotin which can bind specifically with high affinity to 

25 a detectable anti-ligand, such as a labeled antibody or avidin. 

"Primer extension reaction" means a reaction between a target-primer hybrid and a 
nucleotide which results in the addition of the nucleotide to an end of the primer, usually the 
3'-end, such that the added nucleotide is complementary to the corresponding nucleotide of the 
30 target nucleic acid. 
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"Primer-extension reagent" means a reagent including components necessary to effect a 
primer extension reaction. Primer extension reagents typically include (i) a polymerase 
enzyme; (ii) a buffer; and (iii) one or more extendible nucleotides. 

5 "Specific binding pair" refers to a pair of molecules that specifically bind to one 

another to form a binding complex. Examples of specific binding pairs include, but are not 
limited to antibody-antigen (or hapten) pairs, ligand-receptor pairs, enzyme-substrate pairs, 
biotin-avidin pairs, polynucleotides having complementary base pairs, and the like. 

10 "Primer" is a polynucleotide capable of selectively annealing to a specified target 

sequence and thereafter serve as a point of initiation of a primer extension reaction wherein the 
primer is extended in the 3'-» 5' a 5'-> 3' direction, typically the latter. 

IL MATERIALS USED IN THE METHOD OF THE INVENTION 

15 

A. Target Nucleic Acid 

The target nucleic acids for use with the invention may be derived from any living or 
once living organisms, including but not limited to prokaryotes, eukaryotes, plants, animals, 
and viruses, as well as synthetic nucleic acids. The target nucleic acids may originate from any 
20 of a wide variety of sample types, such as cell nuclei (e.g., genomic DNA) and extranuclear 
nucleic acids, e.g., plasmids, mitrochondrial nucleic acids, and the like. The target nucleic 
acids can include DNA or RNA, and are usually DNA. 

Many methods are available for the isolation and purification of a target nucleic acid 
25 for use in the present invention. The preferred purification method should provide target 
nucleic acid sufficiently free of protein to allow efficient primer extension and nucleic acid 
amplification. Preferred purification methods include (i) organic extraction followed by 
ethanol precipitation, e.g., using a phenol/chloroform organic reagent (Ausubel), preferably 
using an automated DNA extractor, e.g., the Model 341 DNA Extractor available from PE 
30 Applied Biosystems (Foster City, CA); (ii) solid phase adsorption methods (Walsh, Boom); 
and (iii) salt-induced DNA precipitation methods (Miller), such methods being typically 
referred to as "salting-out" methods. Optimally, each of the above purification methods is 
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preceded by an enzyme digestion step to help eliminate protein from the sample, e.g., digestion 
with proteinase K, or other like proteases. 

To increase sensitivity, preferably the target nucleic acid is amplified prior to 
performing the method using a suitable nucleic acid amplification procedure. Such 
amplification may be linear or exponential. In a preferred embodiment, amplification of the 
target nucleic acid is accomplished using the polymerase chain reaction (PCR) (Mullis). 
Generally, the PCR consists of an initial denaturation step which separates the strands of a 
double stranded nucleic acid sample, followed by the repetition of (i) an annealing step which 
allows amplification primers to anneal specifically to positions flanking a target sequence; (ii) 
an extension step which extends the primers in a 5'-> 3' direction thereby forming an 
amplicon nucleic acid complementary to the target sequence, and (iii) a denaturation step 
which causes the separation of the amplicon and the target sequence. Each of the above steps 
may be conducted at a different temperature, where the temperature changes may be 
accomplished using a thermocycler (PE Applied Biosystems, Foster City, CA). 

The generalized structure of a target nucleic acid for use in the present invention is 
shown in FIG. 1 where the target nucleic acid 5 includes a 5' -flanking portion 10 including a 
primer complementary portion 15, a 3'-flanking portion 25, and a repeat region 20 located 
between the S'-flanking portion and the and the 3'-flanking portion. The repeat region 20 of 
the target nucleic acid comprises multiple repeat units (R) n 21 where R indicates a repeat unit 
and n designates the number of repeat units making up the repeat region. The repeat unit R 
may be any type of repeat motif, for example, but not limited to a microsatellite repeat 
(Webber and May; Smeets; Williamson), a minisatellite repeat (Jeffreys, Caskey), or an <x- 
satellite repeat (Jabs). 

The repeat region may be made up of multiple types of repeat units or repeat units 
which are themselves polymorphic. 

B. Primer 

Primers for use in the present invention are designed to obtain a balance between 
specificity of primer annealing, i.e., the frequency with which an undesired target sequence 
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participates in a primer extension reaction, and efficiency of primer extension, i.e., the extent 
to which a desired target sequence participates in the primer extension reaction. 

Specificity of primer annealing is generally controlled by the length of the primer and 
5 the temperature of the annealing reaction. Polynucleotides between about 18 and 24 bases are 
preferred because such polynucleotides tend to be very sequence specific when the annealing 
temperature is set within a few degrees of a primer melting temperature (Dieffenbach). To 
facilitate primer extension, a 3'-end of the primer includes an — OH group or other moiety 
which allows incorporation of a nucleotide onto the 3*-end of the primer. There exist a number 
10 of computer programs to facilitate primer selection in different contexts (Osborne; Montpetit). 

In a preferred embodiment, the sequence of the primer is selected such that the primer 
anneals to the primer complementary portion of the 5 '-flanking portion of the target nucleic 
acid. Preferably, the primer anneals such that a 3'-end of the primer is adjacent to a 5'-end of a 
15 repeat region of a target nucleic acid. However, the primer may also anneal to a segment of 
the repeat region of the target nucleic acid so long as it is at least partially anchored to the 5'- 
flanking portion of the target. 

For embodiments of the invention which employ sample identifier tags and arrays of 
20 tag complements, the invention utilizes a plurality of extendable, different-sequence primers 
for detecting target sequences of interests. In one embodiment, the tagged primer includes a 
target binding segment, a tag segment, and an extendable primer end (5 1 or 3'). The target 
binding segment includes a polynucleotide sequence which is selected to bind to a selected 
target sequence. The tag segment contains a unique polynucleotide sequence that allows 
25 identification of the target binding segment to which the tag segment is attached. The tag 
segment can be directly attached to the distal end of the target binding segment, or is optionally 
linked to the tag segment by an intervening spacer group. In another embodiment, the tag 
segment is linked to an internal site within the target binding segment. Thus, the tag can be 
linked to an intersubunit linking group, or to a nucleotide base, within the target binding 
30 segment. Preferably, the tag is attached to an end of the target binding segment that is distal 
with respect to the extendable end of the primer. 

The sequence of each target binding segment is selected to hybridize to a selected 

complementary target which contains a potential polymorphism or mutation, preferably such 
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that the 3 , -end of the primer is adjacent to a 5'-end of a repeat region of a target nucleic acid 
(for extension in the 5* to 3' direction). However, the primer may also anneal to a segment of 
the repeat region of the target nucleic acid so long as it is at least partially anchored to the 5*- 
flanking portion of the target. 

5 The length of the target binding segment in each tagged primer is selected to ensure 

specific hybridization of the primer to the desired target, without significant cross- 
hybridization to non-target nucleic acids in the sample. Also, to enhance primer specificity, it 
is preferred that the melting temperatures of the target binding segments are within a few 
degrees of each other. Preferably, the melting temperatures of the target binding segments fall 

10 within a ATm range (Tmax - Tmin) of 10°C or less, and preferably 5°C or less. This can be 
accomplished by suitable choice of binding segment lengths based on known methods for 
predicting primer melting temperatures (Breslauer, 1986; Rychlik, 1989 and 1990; Wetmur, 
1991; Osborne. 1991; Montpetit, 1992) for example. As above, target binding segments 
between about 18 and 24 bases in length are preferred. 

15 

The tag segment in each tagged primer is designed to contain a sequence that uniquely 
identifies the attached target binding segment. Thus, the tag sequences should be selected to 
minimize (1) internal, self-hybridization, (2) hybridization with other same-sequence tags, (3) 
hybridization with other, different sequence tag complements. (4) and hybridization with the 
20 sample polynucleotides. Also, it is preferred that each tag can specifically recognize and 
hybridize to its corresponding tag complement under the same conditions for all tags in the 
primers. 

Tag sequences can be selected by any suitable method. For example, computer 
25 algorithms for selected non-crosshybridizing sets of tags are described in Brenner (1996) and 
Shoemaker (1997). Preferably, the tag sequences have strands that are within a preselected 
temperature range, as discussed above with respect to the extendable primers. Preferably, the 
melting temperatures of the target binding segments fall within a ATm range (Tmax - Tmin) of 
10°C or less, and preferably within 5°C or less, as calculated using any of the methods above 
30 (e.g., Breslauer). Preferably, the tag segments are at least 12 bases in length to facilitate 
specific hybridization to corresponding tag complements. Typically, tag segments are from 12 
to 60 bases in length, and typically from 15 to 30 bases in length. 
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Tags and tag complements may be single or double stranded, such that sequence 
specific hybridization forms either duplexes by Watson and Crick base-pairing, or triplexes by 
forward or reverse Hoogsteen bonding. In embodiments where specific hybridization occurs 
via triplex formation, coding of tag sequences follows the same principles as for duplex- 
forming tags; however, there are further constraints on the selection of word sequences. 
Generally, third strand association via Hoogsteen type of binding is most stable along 
homopyrimidine-homopurine tracks in a double stranded target. Usually, base triplets form in 
T-A*T or C-G*C motifs (where "-" indicates Watson-Crick pairing and "*" indicates 
Hoogsteen type of binding); however, other motifs are also possible. For example, Hoogsteen 
base pairing permits parallel and antiparallel orientations between the third strand (the 
Hoogsteen strand) and the purine-rich strand of the duplex to which the third strand binds, 
depending on conditions and the composition of the strands. 

There is extensive guidance in the literature for selecting appropriate sequences, 
orientation, conditions, nucleoside type (e.g. whether ribose or deoxyribose nucleosides are 
employed), base modifications (e.g. methylated cytosine, and the like in order to maximize, or 
otherwise regulate, triplex stability as desired in particular embodiments, e.g., Brenner (supra). 
More generally, conditions for annealing single-stranded or duplex tags to single-stranded or 
duplex sequence complements are well known, e.g. Brenner (supra), Ji et al. (1993), Cantor et 
al. (supra), Wetmur (1991), Breslauer et al. (1986), Schena (1995), and the like. 

Preferably, polynucleotides such as primers are synthesized conventionally on an 
automated DNA synthesizer, e.g., PE Applied Biosystems (Foster City, CA) model 392 or 394 
DNA/RNA Synthesizer, using standard chemistries, e.g., phosphoramidite chemistry 
(Beaucage). In an alternative method, primers can be isolated from a biological source. 

C. Solid Phase Supports 

In a preferred embodiment of the method of the invention, a target-primer hybrid is 
attached to a solid phase support during a separating step. Such attachment may be through 
either the target nucleic acid or the primer polynucleotide. 

Solid phase supports for use with the invention may have a wide variety of forms, 

including microparticles, beads, membranes, slides, plates, micromachined chips, and the like. 

In addition, solid phase supports of the invention may comprise a wide variety of 
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compositions, including glass, plastic, silicon, alkanethiolate-derivatized gold, GaAs, copper, 
germanium, cellulose, low cross-linked and high cross-linked polystyrene, crosslinked 
polyacrylamide matrices, silica gel, polyamide, membranes such as nylon, polyvinylidine 
difluoride (PVDF), or poly-tetrafluoroethylene, and the like. 

5 

Where attachment of the target-primer hybrid is through the primer, primers may be 
used with a solid phase support on which they were synthesized, or they may be separately 
synthesized and attached to a solid phase support for use during or before the separation step of 
the method. 

10 

When primers are synthesized on and used with the same solid phase support, such 
support may comprise a variety of forms and include a variety of linking moieties. Such 
supports may comprise microparticles or planar arrays, or matrices of regions having 
substantially uniform populations of primers. A wide variety of microparticle synthesis 

15 supports may be used with the invention, including microparticles made of controlled pore 
glass (CPG), highly cross-linked polystyrene, acrylic copolymers, cellulose, nylon, dextran, 
latex, polyacrolein, and the like. Microparticle supports further include commercially available 
nucleoside-derivatized CPG and polystyrene beads (e.g. available from Applied Biosystems, 
Foster City, CA); derivatized magnetic beads; polystyrene grafted with polyethylene glycol 

20 (e.g., TentaGel, Rapp Polymere, Tubingen Germany); and the like. Selection of the support 
characteristics, such as material, porosity, size, shape, and the like, and the type of linking 
moiety employed depends on the conditions under which the primers are used. For example, 
in the present invention, supports and linkers that minimize steric hindrance of the polymerase 
enzymes and that facilitate access to nucleotide substrate are preferred. Other important 

25 factors to be considered in selecting the most appropriate microparticle support include size 
uniformity, efficiency as a synthesis support, degree to which surface area known, and optical 
properties, e.g., clear smooth beads provide instrumentational advantages when handling large 
numbers of beads on a surface. 

30 As mentioned above, primers may also be synthesized on a single (or a few) solid phase 

supports to form an array of regions uniformly coated with primers. That is, within each 
region in such an array the same primer is synthesized. Techniques for synthesizing such 
arrays are disclosed elsewhere (Pease; Southern). 
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When primers are separately synthesized, and subsequently attached to a solid phase 
support for use, the primer may be attached to the support through a covalent linkage or a non- 
covalent linkage. When the primer is attached to the solid support through a non-covalent 
linkage, the primer includes one member of specific binding pair, e.g., biotin, the other 
member of the pair being attached to the solid support, e.g., avidin. Several methods are 
available for covalently linking polynucleotides to solid supports, e.g., through reaction of a 5'- 
amino polynucleotide with an isothiocyanate-functionalized glass support (Guo). A wide 
range of exemplary linking moieties for attaching primers onto solid supports either covalently 
or non-covalently are disclosed elsewhere. (Pon; Webb; Barany; Damha; Beattie; Maskos and 
Southern). 

Where attachment of the primer-template hybrid is through the template nucleic acid, 
and the template nucleic acid is a PCR amplicon, the means for covalent or non-covalent 
attachment may be incorporated into a PCR primer used to effect the PCR. Thus, the PCR 
primer may contain a member of a specific binding pair, e.g., biotin, or a reactive moiety 
which can react with a functionalized solid support to form a covalent linkage, e.g., a 5 '-amino 
group which reacts with a an isothiocyanate-functionalized glass support. 

As noted above, the invention also utilizes a set of tag complements which are 
complementary to corresponding tag sequences in the tagged primers. The tag complements 
are provided as an addressable array, according to the design choice of the user. By 
"addressable array" is meant that the sequence of the target binding segment of each primer is 
known or can be determined from the position of hybridization of each primer on the array. 
Preferably, the tag complements are immobilized in discrete regions on a planar surface, such 
that each discrete region contains only tag complements having a particular sequence, and such 
that the sequence of the tag complement at each different discrete region is known. 
Conveniently, the tag complements are distributed as a periodic two-dimensional array of 
discrete tag complement regions which can be indexed via X and Y coordinates, or any 
equivalent thereof. Tag complements can be attached to appropriate solid phase support 
materials following the same considerations as for attachment of primers, discussed above. 

To reduce the amounts of assay reagents used for tag detection, and to facilitate the 

sequencing of large numbers of fragment sequences, the arrays are preferably formed as 

microarrays having tag complement region densities of greater than 100 regions/cm 2 , 300 
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regions/cm 2 . 10 3 regions/cm 2 , 3 x 10 3 regions/cm 2 , 10 4 regions/cm 2 , 10" regions/cm 2 , or 10 6 
regions/cm 2 . In addition, the number of different sequence tag complements in each array is 
preferably equal to or greater than 10, 20, 50, 100, 200, 500. 1000, 3000, 10,000, 30,000, 
100,000, or 300,000. 

D. Labeled Nucleotides 

In the methods of the present invention, one or more extendible nucleotides and/or 
nucleotide terminators include a label. The label is attached to the nucleotide in such a way 
that the label does not substantially interfere with polymerase-mediated incorporation of the 
labeled nucleotide in a primer extension reaction. Many alternative methods are available for 
labeling nucleotides in a manner suitable for use with the present invention (Kricka). In a 
preferred class of labeling methods, a nucleoside base of the nucleotide is modified to include 
a label, i.e., the N-6 position of a purine base or the C-5 position of a pyrimidine base. A 
particularly preferred class of labeled nucleotides are the propargylethoxyamino nucleotides 
(Khan). 

In one preferred embodiment of the invention, a labeled extendible nucleotide is 
capable of being rendered undetectable, e.g., upon treatment with a suitable reagent or 
electromagnetic radiation. In this embodiment, the labeled extendible nucleotide may be 
rendered undetectable by either removing the label from the nucleotide or by destroying the 
signal-producing properties of the label. 

Several methods are available for attaching a label to an extendible nucleotide such that 
the label may be easily removed. Exemplary cleavable linkers for linking a label to a 
nucleotide include but are not limited to (N-[4-(p-azidosalicylamido)butyl]-3 , -[2'- 
pyridyldithio]propionamide (APDP), (bis[2-(succinimidooxycarbonyloxyl)ethyl]sulfone 
(BSOCOES), disuccininimdyl tartarate (DST), and ethylene glycobis-[succinimidylsuccinate] 
(EGS), and the like (Pierce Catalog), 

Preferred labels whose signal producing properties may be destroyed upon treatment 

with a suitable reagent or electromagnetic radiation include fluorescent labels whose 

fluorescent properties may be destroyed by photodestruction through exposure to high intensity 

light or by chemical degradation through treatment with oxidizing chemicals, e.g., oxygen, 

sodium hypochlorite, permanginate and the like. Alternative preferred classes of labels 
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include enzymes, which may be rendered undetectable by reaction with an irreversable 
enzyme inhibitor or by denaturation, and chemiluminescent labels which can undergo only a 
single light-producing transformation and are thus autodestructing. 

5 E. First and Second Primer-Extension Reagents 

The present invention includes first and second primer extension reagents that, when 
used according to methods of the invention, enable the extension of a primer to proceed in 
discrete increments of single repeat units, i.e., the primer is extended only by one repeat unit 
per discrete primer extension reaction cycle. 

10 

The first primer extension reagent of the invention includes a set of extendible 
nucleotides which allow a primer extension reaction to proceed only to the extent that a primer 
is extended by an amount less than a full repeat unit. Thus, depending on the particular 
sequence of the repeat unit, the first primer extension reagent may include a variety of possible 
15 extendible nucleotide combinations. For example, if the sequence of the repeat unit is AGCT, 
the first primer extension reagent could include extendible nucleotides T (complementary to 
A), T and C (complementary to A and G), or T and C and G (complementary to A and G and 
C). However, to prevent uncontrolled continuous primer extension, the first primer extension 
reagent should not contain extendible nucleotides T, C, G and A together. 

20 

In certain situations, it is desirable to sub-divide the first primer extension reagent into 
separate sub-reagents such that each sub-reagent includes extendible nucleotides sufficient to 
allow extension of a primer only to the extent that a sub-portion of the repeat sequence is 
formed. For example, if the repeat unit is ATGCCGT, one sub reagent of the first primer 
25 extension reagent could include extendible nucleotides T, A, and C, while another sub-reagent 
of the first primer extension reagent could include extendible nucleotides G and C. 

The second primer extension reagent of the invention includes a set of extendible 
nucleotides which allow a primer extension reaction to proceed only to the extent that the 
30 portion of a repeat unit not synthesized by the first primer extension reagent is synthesized. 
Thus, depending on the particular sequence of the repeat unit and the composition of the first 
primer reagent, the second primer extension reagent may include a variety of possible 
nucleotide combinations. Continuing the example discussed above, if the sequence of the 
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repeat unit is AGCT and the first primer extension reagent includes extendible nucleotides T 
and C, the second primer extension reagent may include extendible nucleotides G and A. 

F. Primer Termination Reagent 
5 The present invention includes a primer termination reagent for causing the termination 

of pnmer extension such that once a primer extension product has reacted with the primer 
termination reagent, no further primer extension may be accomplished. 

The primer termination reagent of the invention includes one or more nucleotide 
10 terminators, and optionally, a set of extendible nucleotides which allow a primer extension 
reaction to proceed only to the extent that a primer is extended by an amount less than a full 
repeat unit in a primer extension reaction. Thus, depending on the particular sequence of the 
repeat unit, the sequence of the 3'-flanking portion of the target nucleic acid, and the 
composition of the first and second primer extension reagents, the primer termination reagent 
15 may include a variety of possible extendible nucleotide and nucleotide terminator 
combinations. For example, if the sequence of the repeat unit is AGCT, and the first nucleotide 
of the 3'-flanking portion is G, and the first and second primer extension reagents include the 
extendible nucleotides T, C, G and A, the primer termination reagent would include only the 
nucleotide terminator C, such nucleotide terminator being complementary to the G located in 
20 the 3'-flanking portion. Alternatively, if the first and second primer extension reagents only 
include the extendible nucleotides T and C, the primer termination reagent would include 
extendible nucleotides G and A and nucleotide terminator C. 

In certain aspects of the present invention, the primer termination reagent includes a 
25 labeled nucleotide terminator. The labeling of the nucleotide terminator is accomplished 
essentially as described above in Section D. 

in. THE METHOD 

A preferred embodiment of a first aspect of the method of the invention is 
30 schematically depicted in FIGS. 2A-C. In the figure, the method is applied to a target nucleic 
acid 5 having a repeat region 20 made up of two copies of a two-base repeat having the 
sequence "AC" and a 3 '-flanking portion 25 having a G nucleotide abutting the repeat region. 
In this preferred embodiment of the first aspect, a primer 200 is annealed to a primer- 
complementary portion 15 of the target nucleic acid 5 thereby forming a target-primer hybrid. 
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The target-primer hybrid is then reacted with a first primer-extension reagent including a 
labeled extendible nucleotide T, resulting in the incorporation of the labeled T nucleotide (T*) 
into the 3'-end of a primer extension product 210. Following reaction with the first primer 
extension reagent, the first primer extension reagent is separated from the target-pnmer hybrid 
and the target-primer hybrid is reacted with a second primer-extension reagent including an 
extendible G nucleotide, resulting in the addition of the G nucleotide into a 3'-end of the 
primer extension product 215. Next the unreacted second primer extension reagent is 
separated from the target-primer hybrid and a measurement is performed to determine the 
amount of labeled extendible nucleotide incorporated into the primer extension product. As 
indicated by the histogram in the figure, at this point in the process, a large signal is detected, 
indicating the presence of the incorporated labeled T nucleotide. Finally, in order to prepare 
the target-primer hybrid for a subsequent discrete primer extension reaction cycle, the label 
attached to the incorporated extendible nucleotide is rendered undetectable. In the example the 
above described process steps are repeated two more times as shown in FIGS. 2 B and 2C. In 
the third cycle shown in FIG. 2C, the intensity of the measured signal is substantially reduced 
as compared to the signal intensities seen in the first two cycles because the labeled extendible 
nucleotide T can not be incorporated into the primer extension product at this point. Thus, this 
reduction in the measured signal seen in the third cycle indicates that the repeat region only 
contains two copies of the AC repeat unit. 

A preferred embodiment of a second aspect of the method of the invention is shown in 
FIGS. 3A-B. As before, the method is applied to a target nucleic acid having a repeat region 
made up of two copies of a two-base repeat having the sequence "AC" and a 3'-flanking 
portion having a G nucleotide abutting the repeat region. In this preferred embodiment of the 
second aspect, as before, a primer 200 is annealed to a primer-complementary portion 15 of a 
target nucleic acid 5 thereby forming a target-primer hybrid. The target-primer hybrid is then 
reacted with a first primer-extension reagent including an unlabeled extendible nucleotide T, 
resulting in the incorporation of the unlabeled T nucleotide into the 3'-end of a primer 
extension product 310. Following reaction with the first primer extension reagent, the first 
primer extension reagent is separated from the target-primer hybrid and the target-primer 
hybrid is reacted with a second primer-extension reagent, including an extendible G 
nucleotide, and a primer termination reagent including a labeled C nucleotide terminator (C*), 
resulting in the addition of only the G extendible nucleotide into the 3'-end of the primer 
extension product 315. Next the unreacted second primer extension reagent and primer 
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terminate reagent are separated from the target-pnmer hybnd and a measurement is 
performed to determine the amount of labeled nucleotide terminator incorporated .nto the 
primer extension product. As indicated in the figure, at this point in the process, no signal .. 
detected indicatine that the labeled nucleotide terminator is not incorporated into the pnmer 
extension product during this cycle of the process. In FIG. 3B, the above descnbed process 
step is repeated. In this second cycle shown in FIG. 3B. the intensity of the measured signal is 
substantially increased as compared to the nominally zero s.gnal intensity seen in the first step 
of the process because during the second step, the labeled nucieotide C terminator (C*) is 
incorporated into the primer extension product. 

The following discussion provides a more detailed description of the above-described 
method steps of the first and second aspects of the invention. 

a Primer Annealing 

The annealing reaction is performed under conditions which are stringent enough to 
guarantee sequence specificity yet sufficiently permissive to allow formation of stable hybrids 
at an acceptable rate. The temperature and length of time required for primer annealing depend 
upon several factors including the base composition, length and concentration of the pnmer, 
and the nature of the solvent used, e.g., the concentration of cosolvents such as DMSO, 
20 formamide, or glycerol, and counter ions such as magnesium. Typically, hybridization with 
synthetic polynucleotides is carried out at a temperature that is approximately 5 to 10°C below 
the melting temperature of the target-primer hybrid in the annealing solvent. Preferably, the 
annealing temperature is in the range of 55 to 7 5 o C and the pnmer concentration is 
approximately 0.2 uM. Under these preferred conditions, the annealing reaction will be 
25 complete in only a few seconds. 

R Primer Extension Reaction 

The time required to effect a primer extension reaction depends upon the length and 
concentration of the target sequence and upon the reaction temperature. Estimates for the rate 
30 of nucleotide addition under typical conditions vary from 35 to 100 nucleotides per second 
depending upon the buffer, P H, salt concentration, polymerase enzyme, and the nature of the 
DNA template. 
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to order to ach.eve a primer ex,ens,on reaction which proceeds in discrete increments 
of single repeat units, according to the method of the invention, the primer extension reaction 
is divided into two independent steps, a firs, primer extension reaction and a second pnmer 
extension reaction. In the firs, pnmer ex,ens,on reaction, .he pnmer ,s extend by an amount 
less than .he length of a single repea. unit, where control of .he ex.en, of pnmer ex«ens,on ,s 
effected by ,he composition of a first primer extension reagent In ,he second primer ex.ens,o„ 
.action, the primer is extended by an amount such that, ,„ combination with .he firs, pnmer 
extension reaction, the pnmer ,s extended by an amount equal ,o .he length of a single repea. 



unit. 



According to the first aspect of the method of the invention, one of the first or second 
primer extension reagents includes a labeled extendible nucleotide. 

Also according to the first aspect of the method of the invention, in a variant of the 
5 above-described two-step discrete pnmer extension reaction, the second pnmer extension 
reagent includes a pnmer terminate reagent. Thus, if after a second pnmer extension 
reaction the pnmer has been extended to the end of the repeat region of the target nuclerc acid, 
a nucleotide terminator will be incorporated into the pnmer extension product thus prohibiting 
any further extension of that pnmer extension product. This i, advantageous because ,t will 
20 remove the possibility that any spurious primer extension beyond the repeat region of the target 
nucleic acid will take place. This embodiment of the invention is particularly preferred where 
two alleles of a repeat sequence are being investigated simultaneously. 

According to the second aspect of the invention, a primer termination reagent including 
25 a labeled nucleotide terminator is included in the second pnmer extens.on reaction. Preferably, 
the labeled nucleotide terminator is selected to be complementary to the nucleotrde at the 3'- 
end of the Shanking portion of the target nucleic acid which abuts the repeat reg.on of such 



target nucleic acid. 



30 C Se paration of P rimer and Pnmrr Extension Reagents 

Between the first and second primer extension reactions, a separation step rs performed 
to prevent the mixing of first and second primer extension reagents and thereby prevent 
uncontrolled primer extension. The means used to effect the separation step may be any means 
capable of separating the target-primer hybrid from the first and/or second pnmer extension 
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reagents. Exemplary separation methods include but are not limited to HPLC, electrophoresis, 
liquid-liquid extraction, solid-liquid extraction, adsorption, and the like. 

In a preferred embodiment, the target-pnmer hybrid is attached to a solid support 
5 during the separation step such that the pnmer extension reagents may be separated from the 
target-pnmer hybnd simply by washing the solid support. Accordmg to this embodunent, the 
primer may be attached to the solid support before or after performmg the fust pnmer 
extension reaction. The washing conditions are such that the target-pnmer hybnd is not 
substantially disrupted and nonspecific adsorpt.on of the pnmer extension reagents is 
10 minimized. 



D. Measurine a Signal 

Subsequent to either the first or second pnmer extension reaction, a detection step >s 
performed wherein the amount of intact label which has been incorporated into a pnmer 
5 extension product is determined. Any detection method may be used which is suitable to the 
type of label employed. Thus, possible detection methods include radioactive detection, 
optical absorbance detection, e.g., UV-visible absorbance detection, optical emission detection, 
e.g., fluorescence or chemiluminescence. 

20 The measunng step can take place at various points in the process depending upon the 

aspect of the invention being practiced and the composition of the first and second pnmer 
extension reagents. In the first aspect, preferably the measunng step takes p.ace after the 
primer extension reagent including the labeled extendible nucleotide has been reacted w,th and 
separated from the target-pnmer hybrid. In the second aspect, the measunng step takes place 

25 after the pnmer termination reagent including the labeled nucleotide terminator has been 
reacted with and separated from the target-primer hybrid 

If the target-primer hybrid(s) are immobilized on a solid support for ana.ysis, extended 
primers can be detected in an addressable array by scanning all or portions of each array 
30 simultaneously or senally, depending on the scanning method used. For fluorescence labehng, 
selected regions on an array may be senally scanned one-by-one or row-by-row using a 
fluorescence microscope apparatus, such as descnbed in Fodor (1995) and Mathies et al. 
(1992) Hybridization patterns may also be scanned using a CCD camera (e.g., Model 
TE/CCD512SF, Princeton Instruments, Trenton, NJ) with suitable optics (Ploem, 1993), such 
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as described in Yershov et al. (1996), or may be imaged by TV monitoring (Khrapko, 1991). 
For radioactive signals (e.g., 32 P), a phosphonmager device can be used (Johnston et al.. 1990; 
Drmanac et at.. 1992; 1993). Other commercial suppliers of imaging instruments include 
General Scannins Inc., (Watertown, MA, wxvw.genscan.com). Genix Technologies (Waterloo, 
Ontario, Canada; www.confocal.com), and Applied Precision Inc. Such detection methods are 
particularly useful to achieve simultaneous scanning of multiple tag complement regions. 



F Rpnrle.ring A Lab*l I Indetectable 
10 According to the first aspect of the method of the invention, once a signal from a label 

is detected, pnor to performing a subsequent cycle of discrete primer extension, the label is 

rendered undetectable. 

In one preferred embodiment, the label is rendered undetectable by cleaving the label 
15 off of the labeled extendible nucleotide incorporated in the pnmer extension product. The 
method used for cleaving the label from the nucleotide depends on the type of cleavable 
linkage used to l.nk the label and the nucleotide. See above. Exemplary cleavage reagents 
include thiol, base, periodate, hydroxylamine and the like. In one preferred method, the label 
is attached to a base-portion of a uracil nucleotide and subsequent to detection the label is 
20 cleaved off of the labeled extendible nucleotide by treatment with the enzyme urac.l N- 
glycosylase (UNG). 

In a second preferred embodiment, the label is rendered undetectable by destroying the 
signal-producing properties of the label itself. Depending on the type of label used, there are 

25 several methods which may be employed for destroying the signal-producing properties of the 
label. For example, if the label is a fluorescent dye, the label may be rendered undetectable by 
photobleaching the dye using an intense light source in an oxygen-rich environment. If the 
label is an enzyme, the label may be rendered undetectable by reacting the label with an 
irreversible enzyme inhibitor which renders the enzyme incapable of catalyzing a signal 

30 producing reaction. If the label is a chemiluminescent agent which can undergo only a single 
light-producing transformation, the label is autodistructing and thus does not require a separate 
reaction to render the label undetectable (Bronstein). 
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F Tag Com plement Arrays 

As noted above, the invention also includes embodiments that utilize primers or sample 
polynucleotides tha, contain tag sequences that uniquely tdemify .he attached pnmers or 
sample polynucleotides, winch can be immobilized on addressable arrays for analysts of 

5 multiple sample polynucleotides in parallel. 

A plurality of different-sequence primers are contacted with a polynucleotide sample 
under conditions effective for the pnmers to anneal to pnmer-complementary regions ,n one or 
more tareet polynucleotides, to form one or more target-pnmer hybrid(s), wherein either (1) 

10 each different-sequence primer contams (i) a target binding segment and (ii) a tag segment 
having a nucleotide sequence that uniquely identifies the target binding segment, or (2) one or 
m ore polynucleotides in the sample are tagged polynucleotides that contain a tag segment 
having a'nucleotide sequence that uniquely identifies the attached polynucleotide. One or 
more primer extension cycles are performed as described above, and the appearance or loss of 

1 5 signal is determined at the appropriate times to measure repeat lengths. 

Depending on the preferences of the user, the tagged pnmers or tagged sample 
polynucleotides can be contacted with an addressable array of immobilized, different-sequence 
tag complements which each contain a sequence that is complementary to one of the tag 
20 segment, under conditions effective to hybridize the tag segments to corresponding tag 
complements on the support. By way of illustration, embodiments using either tagged pnmers 
or tagged sample polynucleotides are shown in Figs. 4 and 5, respectively, after the tag 
segments have been hybridized to a solid phase support, and a target-primer hybrid has been 
formed to yield tertiary polynucleotide complexes. 

Fig 4 shows a ternary complex wherein a tag complement oligonucleotide 410 
immobilized on a solid phase support 400 is specifically hybndized to a tagged pnmer 420. 
Tagged primer 420 contains a target binding segment 422 and a tag segment 424. The vertical 
lines between the tag complement oligonucleotide 410 and tag segment 424 indicate 
30 complementary base-painng. Also shown is a sample polynucleotide 440 comprising a 
sequence 442 which hybridizes to target binding segment 422. and which, in the embodiment 
shown, terminates at a nucleotide immediately adjacent to a sample repeat region 444. Sample 
polynucleotide 440 optionally includes flanking sequences 446 and 448 which do not hybndize 
to the tagged primer. Once polynucleotide 440 and tagged pnmer 420 have formed a target- 
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primer hybnd, the hybrid can be treated as d.scussed in preceding sections to extend the 
extendable end of the primer into the repeat region of the sample polynucleotide. Extension 
cycles are repeated until the end of the repeat region has been reached, or the desired number 

of repeats have been counted. 

Fig 5 shows an alternative ternary complex wherein a tag complement oligonucleotide 
510 immobilized on a solid phase support 500 is specifically hybridized to a tagged sample 
polynucleotide 520. Polynucleotide 520 includes a tag segment 524 which is hybridized to 
oligonucleotide 510, and which is connected to a sample sequence 522. Sequence 522 includes 
a repeat region 528 which is flanked on either side by first and second sample segments 526 
and 530 An extendable pnmer 540 is annealed to one of sample segments 526 and 530 to form 
a target-primer hybrid such that the extendable end is adjacent to (or protrudes into) the sample 
repeat region 528. The hybrid can be treated as above to extend the extendable end of the 
primer into the repeat region of the sample polynucleotide, until the end of the repeat region 
1 5 has been reached. 

Tagged sample polynucleotides can be formed by any suitable method. Conveniently, 
tagged samples can be formed by PCR amplification using primer pairs comprising first and 
second target-specfic primers which flank each sample sequence of interest, wherem one of 
the probes contains an identifier tag segment. In one embodiment, the tagged pnmer « 
constructed so that the tag segment is not copied dunng amplification, either due to the 
presence of non-standard intemucleos.de linkages (e.g., PNA linkages), or due to the presence 
of a non-polynucleotide linker region which separates the tag segment from the target-bmdmg 
segment, as can be prepared by standard synthetic methods. After the tag segment is hybridized 
25 to a tag complement on an addressable array, the non-tagged strand can be removed from the 
tagged strand, e.g., by elevated temperature and/or reducing salt concentration to destabilize 
DNA-DNA duplex structure, particularly if either the tag melts from the tag complement at a 
higher temperature than the melting temperature between the target-binding segment of the 
pnmer and the sample polynucleotide. Many other approaches for obtaining a selected strand 
30 from duplex nucleic acids are known in the art, and include for example, (1) the use of a 
biotinylated primer in PCR to enable capture and separation of the non-tagged strand from the 
tagged strand under denaturing conditions, (2) the use of a PCR primer (non-tagged) that 
contains a short RNA segment which can be degraded with RNAse after PCR is complete, 
followed by destruction of the second RNAse degraded strand with an enzyme that has an 
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ri«t, DNAse) or an asvmmetric PCR method which 
exonuclease activity (e.g., an appropriate DNAse), 

favors amplification of the tagged primer strand. 

reference iu n 5 , extendable end of the pnmer 

526, between the tag segment and the repeat reg.on, such that the 

is again adjacent to sample repeat region 528. 

The tagged primers or tagged sample polynucleotides can be hybndized to a tag 

===== 

fo , contacttng step ,n „h,ch .he d*e* J ^ ^ ^ 

annealed to form target-pnmer hybrids. Alternatively,™ 
, 5 lhe flra pnm er _ -10, or a*e, ,e second ^J^^l 

case, a stna,, a„u,ot of the e«on <-°~ " pIovides Munt , ng 

cycle and hybridized to replicate arrays, such that each repi. 

information after each cycle. 

, , , T - TT ~ rnr pt? *,rTTrTN r - TUF METHOD 

20 ~ yrrSF0RPR ^ aJ& ' , . for carryin o out the vanous aspects and 

The present invention includes kits for carrying ou ..,,,„„ 

■ , B^tacnwt kits ofthe invention include a 

r a T:::;;::r:::~: :^ — - — — * 

be rendered undetectable following a treatment step effect.ve to cleave 

— P rod,t and,or destroy the ^, P~ « - * . 

Hybrid through the pnmer or the urge, nucleic acid. Optionally, the k„s 
the invention include a primer termination reagent. 
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In a second aspect, the kits of the invention include a primer, a first primer extension 
reagent, a second primer extension reagent, and a pnmer termmation reagent, wherein the 
primer termination reagent includes a nucleotide terminator having a label attached thereto. 
The second primer extension reagent and the pnmer termination reagent may be packaged 
either separately or together as a mixture. Preferably, the pnmer is bound to a solid support, 
or is capable of being bound to a solid support through a specific binding pair or through a 
covalent linking moiety. In another preferred embodiment, this aspect of the invention includes 
a solid-phase support for attachment of a target-primer hybrid through the primer or the target 
nucleic acid. 

in another embodiment, the kits can include (A) a plurality of different-sequence 
pnmers. each containing (i) a target binding segment and (ii) a tag segment having a nucleotide 
sequence that uniquely identifies the target binding segment; a first primer extension reagent; 
and a second primer extension reagent, wherein at least one of the first or second pnmer 
extension reagents includes an extendible nucleotide having a label attached thereto: and/or (B) 
an addressable array of immobilized, different tag complements, wherein each different tag 
complement contains a sequence that is complementary to one of the primer tag segments, 
under conditions effective to hybridize the tag segments to corresponding tag complements on 
the support. 

The invention may be further understood in light of the following examples, which are not 
intended to limit the invention. 

Example 

The following study illustrates exemplary sample hybndization conditions and 
extension cycles for counting repeats in four different sample sequences using an array of 
target-specific capture oligonucleotides immobilized on a solid support. 

A Support. Glass microscope slides were immersed in 1 N HN0 3 for 1 to 2 hours, 
followed bv nnsing with deionized water. Optionally, slides were soaked overnight in 5% HC1 
to improve long-term stability of subsequent functionalizizadon. Slides were then sonicated 
sequentially in the following three solvents for 10 minutes each: hexane, acetone, and ethanol, 
followed by drying in air. 
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A solution of 2% (v/v) am.nopropyltnethoxy silane (0.8 mL in 40 mL) in 95% 
acetone:5% water was prepared in a dtsposab.e pLastic Falcon tube. This amount of solution 
w« usuallv sufficient to coat at least three slides. The solution was allowed to stand for 5 to 
20 m,nutes to hydrolyze the ethoxy groups to hydroxyl groups. Air-dried slito were dipped n. 
5 the solution for 2 minutes each, and then nnsed by dipping in three or more successive acetone 
baths. 

The slides were cured in 100°C oven for 45 minutes. Cured slides were treated for 2 
hours with a 0.2% solution of 1 ,4-phenylene diisothtocyanate (PDITC) in 10% 

10 P yridine:dimethyl formamide, followed by washes in baths of methanol and acetone, and air- 
drying. The slides may be stored under vacuum with a dessicant at 4-C. Stacking the slides 
also helps preserve the nationalization and keeps the functinahzed surfaces free from 
particulates. (In an a.temat,ve embodiment, instead of 1 ,4-phenylene ditsothiocyanate, the 
cured slides can be treated with a 1 mM solution of EMCS (N-Ce-maleimido caproxyl) 

15 succinimide in methanolidimethylsulfoxide (80:20) for 2 hours.) 

B Tn^-^n »f future Olieos. Synthetic capture oligonucleotides were 
prepared by standard phosphoramidite synthetic methods. The capture oligonucleotides had the 
following general structure: S'-amino-KPEOhHcapture ol.go (24 „t)]. where PEO - -O- 
20 (CH,CH,0) 6 added via DMT-0-(CH 2 CH 2 0) 6 -phosphoramidite. The capture oligonuc.eot.des 
had polynucleotide sequences complementary to the double-underlined sequences in the four 
sample sequences shown in section C below. 

The capture oligos were spotted in a rectangular array pattern onto slides prepared as 
25 above ustne an ASYMTEC robotic loader (Asymtec, Carlsbad, CA, Model No. C-708). w,th 
oligonucleotide concentrations of about 50 uM (in 100 mM Na 2 CO, P H 9.0) and spotting 
volumes of about 0.2 to 1 nL. The pattern included a separate set of 8 spots (2 x 4 rows) for 
each different capture oligo to provide redundancy. The spots had diameters of about 200 
micrometers and were spaced apart by 320 micrometers center-to-center. The robotic loader 
30 included a sl.de holder to hold the slides during oligonucleotide deposition, and a blotting area 
for cleaning dispenser tips. 
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After spotting, the capture oligos were fixed onto the slides by incubating the slides in a 
humidity chamber at room temperature for 60 minutes, followed by soaking in 1% NH4OH 
(aq) for 20 minutes and in 0.2% casern in TBS (50 mM Tns. 150 mM N.C1, P H 8) for 20 
minutes. The slides were washed in deionized water and stored at 4°C. 

C c., r ,.»^,.t;nn,nd Analysis. Four different sample oligos were prepared to 
represent four different microsate.lite alleles. In brief, the sample polynucleotides included the 
following sequences, where single-underUmng indicates repeat regions, and double underlining 
indicates segments for hybridizing to complementary capture oligos on the support: 

Sam ple OligQ "fin_?r, {SF.O ID NO:l) 

5 , GTr y^r aa a gtg ATTTG ATGT A G ATTTTGA-3 

Samnle Qlieo 1 All .1 H (SF.O ID NO:2) 

r r,Tr - 1 nn A A T A M r - TnTA A A agtgt ATGAT-3' 

Sam ple Oligo ? i (SF.O ID NO:3) 

-. r>Tr , . r *, PTGT A T^f A T A a Anr.ATTTTGATTGA-3' 

Samnle Oliac ao^-AT (SF.O ID NQ.4) 

A mixture of the four sample oligos (0.33 uM each in IX PGR buffer containing 1.5 
mM MgCh, 50 mM KC1, 1.5 mM MgCl 2 , 10 mM Tris-HCl, pH 8.3, and 0.001% (w/v) gelatin) 
was placed over the capture oligos on the array and allowed to incubate for 1 hour at room 
temperature. After the incubation, unbound oligo. were removed with IX TE (10 mM Tns, 
pH 8 1 mM EDTA) containing 50 mM NaCl. Replicate slides were subjected to different 
numbers of extension cycles (one, two, three or four cycles). Each extension cycle (also called 
a "reagent cycle") involved the following steps: 

1) Incubate slide for 4 minutes at 37°C in an extension mixture containing: 
500 uM dGTP 

20 uM Big Dye R6G ddATP (Perkin-Elmer, Foster City, CA, Emax = 560 nm) 
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20 uM Big Dye dRox ddCTP (Perkin-Elmer, Foster City. CA, Emax = 61 5 nm) 
1 U/uL AMPLITAQ FS (plus 0.125 U/uL pyrophosphatase) 
IX buffer (80 mM Tris, pH 9.0, 2.5 mM MgCh) 

2) Rinse slide with IX TE containing 50 mM NaCl, then wash 3 times by Emersion in same 
solution. 

3) Incubate slide for 4 minutes at 37°C in an extension mixture containing: 

500 uM dATP 

?0 uM Big Dye R6G ddATP (Perkin-Elmer, Foster City, CA, Emax = 560 nm) 
20 uM Big Dye dRox ddCTP (Perkin-Elmer, Foster City, CA, Emax = 615 nm) 

1 U/uL AMPLITAQ FS (plus 0.125 U/uL pyrophosphatase) 

IX buffer (80 mM Tris, pH 9.0, 2.5 mM MgCl 2 ) 

4) Rinse slide with IX TE contammg 50 mM NaCl, then wash 3 times by immersion in same 

solution. 

After 4 extension cycles had been competed (each cycle included steps 1 through 4), each 
slide was overlaid with a viewing buffer con,a,ning 50% (w:v) urea and IX TBE (0.09 M 
Tris-borate, 2 mM EDTA. pH - 8.3) .0 provide an alkaline pH environmem to enhance the 

fluorescence emissions of the dye labels. 

The sl.des were each viewed using an imaging device se, for fluorescence emission 
detection a. 560 nm and 6,0 nm. The device was an imagtng fluorimeter that produces a two- 
dimensional image array of emission mtensities (electronic image). Each po,n, in the , mage 
arrav corresponds ,0 a physical location on the sample sl.de. The excitatton source ,s a 40 mW 
argon ion laser. The excitatton wavelength can be selected from either of two natural laser 
lines (488 nm and/or 514 nm). The selection of laser wavelengths is accomplished by passmg 
the beam which contains light of both wavelengths through one of three filters. The selected 
filter wil, pass either 488 nm on.y, 514 run only, or both 488 and 514 nm. The filters are 
mounted on a moter driven platform so that ,h. selection can be performed under computer 
control. The beam containing either or both wavelengths passes through a classical beam 
expander. The collimated expanded beam is then directed into a commercially avatlable 
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assembly (Scanlab. Puchhein, Germany, Model SK1020) containing two orthogonal "galvo 
mirrors" This is a mechanical device designed to rotate each of the two mirrors quickly and 
precisely over a small angle. The axes of rotation are orthoganal and independent so that the 
beam can be rastered over a rectangular pattern, also under computer control. 

The scanned beam is focused by an f-theta lens which forms a scanned laser spot at one 
focal length. The spot size is 20 urn diameter. Typically, it is stepped over an area of 20 mm x 
20 mm in increments of 20 urn, thereby forming a 1000 x 1000 array of pixels. The focused 
laser spot is formed after passing through a dichroic beam splitter which is designed to 
reflected the laser excitation wavelengths but transmit the fluorophore emission. 

As the laser beam steps across a fluorescent region, the fraction of fluorescence 
radiation within the solid angle of collection of the emission optics is transmitted by the 
dichroic mtrror and i. collated by a lens having a 150 mm focal length. The col.imated 
beam is then filtered by a long pass interference filter which further rejects laser light. A 
second lens of 75 mm focal length focuses the beam onto the photocathode of a red sensitive 
photomultiplier tube (PMT). The PMT photocathode does not need to be carefully adjusted 
since the focus is not critical. A filter wheel in front of the PMT allows only a small 
wavelength band (10 nm) to reach the detector. The filter wheel can be adjusted to one of stx 
positions to allow for emission of multiple fluorophores to be discriminated. 

The electncal output of the PMT is proportional to the intensity of the light reaching 
the photocathode. The computer system is capable of stonng the Signals at each of an array of 
mirror positions, to form the electronic image. 

The results were as expected. Specifically, for oligo 1 (SEQ ID NO:l), no signal was 
observed until completion of the second cycle, after which two GT dimers and a fluorescence- 
labeled ddC terminator had been appended to the immobilized capture oligo, which also served 
as the extendable primer. Similarly, for ol.gos 2 through 4 (SEQ ID NO:2 through 4), the 
30 appropriate fluorescent signal was observed after 1, 2 and 4 extension cycles, respectively. 

All publications and patent applications refered to in this disclosure are herein 
incorporated bv reference to the same extent as if each individual publication or patent 
application was specifically and individually indicated to be incorporated by reference. 
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Although only a few embedments have been desenbed in detail above, those having 
ordinary skill in the molecular biology art will clearly understand that many modifications are 
possible in the preferred embodiments w.thout departing from the teachings thereof. 
According all such modifications are intended to be encompassed within the scope of the 



following claims. 
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WE CLAIM: 

1 . A method for determining the number of repeat units in a repeat region of a target 

nucleic acid comprising the steps of: 

(A) contacting a plurality of different-sequence primers with a polynucleotide sample 
under conditions effective for. said primers to anneal to primer-complementary regions in one 
or more target polynucleotides, to form one or more target-primer hybrid(s), wherein either (1) 
each different-sequence primer contains (i) a target binding segment and (ii) a tag segment 
having a nucleotide sequence that uniquely identifies the target binding segment, or (2) one or 
more polynucleotides in the sample are tagged polynucleotides that contain a tag segment 
having a nucleotide sequence that uniquely identifies the attached polynucleotide, 

(B) performing a first primer extension reaction on said hybrid(s) using a first primer 
extension reagent; 

(C) separating the target-primer hybrid(s) and unreacted first primer extension reagent; 

(D) performing a second primer extension reaction on said hybrid(s) using a second 
primer extension reagent, wherein at least one of the first or second pnmer extension reagents 
includes an extendible nucleotide having a label attached thereto; 

(E) separating the target-primer hybrid(s) from unreacted second primer extension 

reagent; 

(F) measuring a signal produced by the label; 

(G) treating the label so as to render the label undetectable; and 

(H) repeating a cycle of steps (A) through (G) until the signal detected in the target- 
primer hybrid(s) is substantially less than a signal detected in a previous cycle, 

wherein (I) prior to step (F), at least an aliquot of either (1) said different-sequence 
primers or (2) said tagged sample polynucleotides are contacted with an addressable array of 
immobilized, different tag complements, and each different tag complement contains a 
sequence that is complementary to one of said tag segments, under conditions effective to 
hybridize the tag segments to corresponding tag complements on the support. 

2. The method of claim 1 wherein step (D) further includes reacting the target-primer 
hybrid(s) with a primer termination reagent. 
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3. The method of claim 1 wherein each different-sequence primer contains (i) a target 
binding segment and (ii) a tag segment having a nucleotide sequence that uniquely identifies 
the target binding segment. 

5 4. The method of claim 3 wherein the contacting in step (I) is performed prior to step 

(A). 



5. The 



method of claim 3 wherein the contacting in step (I) is performed after step (A). 



10 6. The method of claim 3 wherein said method is performed on at least two replicate 

arrays, and one of the replicate arrays is subjected to at least one more cycle of steps (A) 
through (G) than is a second replicate array. 

7. The method of claim 1 wherein one or more polynucleotides in the sample are tagged 
15 polynucleotides that contain a tag segment having a nucleotide sequence that uniquely 

identifies the attached polynucleotide. 

8. The method of claim 7 wherein the contacting in step (I) is performed prior to step 

(A). 

20 

9. The method of claim 7 wherein the contacting in step (I) is performed after step (A). 

10. The method of claim 7 wherein said method is performed on at least two replicate 
arrays, and one of the replicate arrays is subjected to at least one more cycle of steps (A) 

25 through (G) than is a second replicate array. 

1 1. The method of claim 1 wherein the label is selected from the group consisting of 
fluorescent and chemiluminescent molecules. 

30 12. The method of claim 1 wherein the label is attached to the extendible nucleotide 

through a cleavable linker. 

13. The method of claim 1 wherein the target nucleic acid is amplified prior to 
analysis. 
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14. The method of claim 13 wherein amplification is achieved using a PCR. 

15. The method of claim 1 wherein the step of treating the label so as to render the label 
undetectable includes cleaving the label from the labeled extendible nucleotide. 

16. The method of claim 1 wherein the step of treating the label so as to render the label 
undetectable includes destroying a signal-producing property of the label. 

17. A method for determining the number of repeat units in a repeat region of a target 
nucleic acid comprising the steps of: 

(A) contacting a plurality of different-sequence primers with a polynucleotide sample 
under conditions effective for said primers to anneal to primer-complementary regions in one 
or more target polynucleotides, to form one or more target-primer hybrid(s), wherein either (1) 
each different-sequence primer contains (i) a target binding segment and (ii) a tag segment 
having a nucleotide sequence that uniquely identifies the target binding segment, or (2) one or 
more polynucleotides in the sample are tagged polynucleotides that contain a tag segment 
having a nucleotide sequence that uniquely identifies the attached polynucleotide, 

(B) performing a first primer extension reaction on said hybrid(s) using a first primer- 
extension reagent; 

(C) separating the target-primer hybrid(s) from unreacted first primer extension 
reagent; 

(D) performing a second primer extension reaction on said hybrid(s) using a second 
primer extension reagent and with a primer termination reagent, the primer termination reagent 
including a nucleotide terminator having a label attached thereto; 

(E) separating the target-primer hybrid(s) from unreacted second primer extension 
reagent and unreacted primer termination reagent; 

(F) measuring a signal produced by the label; and 

(G) repeating a cycle of steps (A) through (F) until a signal is detected indicating 
incorporation of the nucleotide terminator, 

wherein (H) prior to step (F), at least an aliquot of either (1) said different-sequence 
primers or (2) said tagged sample polynucleotides are contacted with an addressable array of 
immobilized, different tag complements, and each different tag complement contains a 
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sequence that is complementary to one of said tag segments, under conditions effective to 
hybridize the tag segments to corresponding tag complements on the support. 

18. The method of claim 17 wherein each different-sequence primer contains (i) a 
5 target binding segment and (ii) a tag segment having a nucleotide sequence that uniquely 

identifies the target binding segment. 

19. The method of claim 18 wherein the contacting in step (H) is performed prior to 
step (A). 

10 

20. The method of claim 18 wherein the contacting in step (H) is performed after step 

(A). 

21. The method of claim 18 wherein said method is performed on at least two replicate 
1 5 arrays, and one of the replicate arrays is subjected to at least one more cycle of steps (A) 

through (F) than is a second replicate array. 

22. The method of claim 17 wherein one or more polynucleotides in the sample are 
tagged polynucleotides that contain a tag segment having a nucleotide sequence that uniquely 

20 identifies the attached polynucleotide. 

23. The method of claim 22 wherein the contacting in step (H) is performed prior to 
step (A). 

25 24. The method of claim 22 wherein the contacting in step (H) is performed after step 

(A). 

25. The method of claim 22 wherein said method is performed on at least two replicate 
arrays, and one of the replicate arrays is subjected to at least one more cycle of steps (A) 

30 through (F) than is a second replicate array. 

26. The method of claim 17 wherein the label is selected from the group consisting of 
fluorescent and chemiluminescent molecules. 
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27. The method of claim 17 wherein the target nucleic acid is amplified prior to 
analysis. 

28. The method of claim 27 wherein amplification is achieved using a PCR. 

5 

29. A kit useful for determining the number of repeat units in a repeat region of a target 

nucleic acid comprising: 

a plurality of different-sequence primers, each containing (i) a target binding segment 
and (ii) a tag segment having a nucleotide sequence that uniquely identifies the target binding 

10 segment, 

a first primer extension reagent; and 

a second primer extension reagent, wherein at least one of the first or second primer 
extension reagents includes an extendible nucleotide having a label attached thereto. 

15 30. The kit of claim 29, which further includes an addressable array of immobilized, 

different tag complements, wherein each different tag complement contains a sequence that is 
complementary to one of said primer tag segments, under conditions effective to hybridize the 
tag segments to corresponding tag complements on the support. 

20 31. The kit of claim 29 wherein the label is selected from the group consisting of 

fluorescent and chemi luminescent molecules. 

32. The kit of claim 29 wherein the label is attached to the extendible nucleotide 
through a cleavable linker. 
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